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Description 

Background of the invention 

s The Immunoglobulin (Ig) Gene Superfamily is comprised of numerous cell surface and soluble molecules that 

mediate recognition, adhesion or binding functions in vertebrates. (Abbas, A.K. et al.. CELLULAR AND MOLECULAR 
IMMUNOLOGY, p. 144 (1991 )). Members of the Ig Superfamily have an evolutionary relationship and share significant 
amino acid sequence and structural similarities. (Williams. A. F. and Barclay, A. N., IMMUNOGLOBULIN GENES, p. 
372 (1989)). Two criteria for membership within the family are: 1)sequence homology with Igor Ig-related polypeptide 

10 domains, which are approximately 70-110 amino acid residues long, and 2) key structural features which include the 
polypeptide domains comprised of a sandwich arrangement of two p-sheets. each made up of four or five anti-parallel 
p-strands of five to ten amino acid residues. (Abbas, A.K., et_aK, CELLULAR AND MOLECULAR IMMUNOLOGY PP. 
H4-145 (1991)). 

The Ig Superfamily domains are classified as either variable (V) or constant (C) based on characteristics of the p- 

IS strands within the p-sheet sandwich. (Abbas, A. K.. eLiL CELLULAR AND MOLECULAR IMMUNOLOGY p. 144 
(1 991 )). For example, in one class of Superfamily molecules, the immunoglobulins, V domains are at the amino-terminal 
ends of separate "heavy" (H) and "light" (L) chains, succeeded in the polypeptide chains by constant (C) domains. 
Thus, in an immunogbbulin, a V domain is defined as either Vh or (Figure 1). In the T cell receptor, V and C refer 
to Ig-like variable and constant domains which are comprised of polypeptide a and p chains. In other Superfamily 

20 molecules, V and C domains may be comprised of y, 5, or e chains. 

In the Ig Superfamily, the polypeptide chains that comprise the V regions, associate to form ligand binding sites. 
For example, in the immunoglobulin molecule, the Vh and Vl domains associate to form the variable fragment (Fv) 
region which comprises the antibody binding site. The Fv region includes both scaffold-like regions, termed framework 
regions (FRs), and regions of hyper-variability, termed complementarity-detemnining regions (CDRs). It is the CDRs 

25 that contribute to the unique antigen specificity of immunoglobulins. (Abbas, A. K.. et al.. CELLULAR AND MOLECU- 
LAR IMMUNOLOGY p.45 (1991)). Under special circumstances, the Fv region has been proteolytically dissected from 
its parent Ig to yield a variable-region fragment (Fv fragment) that is comprised of two non-covalently associated do- 
mains (Vh-Vl. a heterodimer). This heterodimeric Fv fragment can further dissociate into single Vl and Vh domains. 
(Huston, J.S. et al. Meth. Enzymol. 203:46-88 (1991)). 

30 Recently, protein engineering methods have been used to link Vh and Vl chains, creating a functional single-chain 

Fv (sFv), which has a solitary antibody binding site that does not dissociate into single domains at low concentrations. 
(Huston, J.S., et al., Proc. Natl. Acad. Sci. USA 85:5879-5883 (August 1988)). 

In this approach, the genes encoding Vh and Vl domains of a given antibody are connected at the DNA level by 
an appropriate nucleotide sequence, and on translation, this gene forms a single polypeptide chain with a peptide linker 

35 bridging the two variable domains. (Huston. J.S. . et al.. Meth. Enzymol. . 203:46-88 (1 991 )). 

Summary of the Invention 

The present invention relates to a chimeric immunoglobulin (Ig) Superfamily protein analogue having more than 

40 one biologically active binding site. Hereinafter, the term multivalent will be used to describe these multiple binding 
sites. The chimeric multivalent Ig Superfamily protein analogue, hereinafter referred to as a CHI-protein, or x-protein, 
is comprised of one or more polypeptide chains forming a p-barrel domain. A single p-barrel domain may comprise a 
chimeric protein binding domain with more than one binding site. Altematively, more than one p-barrel domain, such 
as the Vl and Vh p-barrel domains In immunoglobulins, may combine to form a larger concentric p-barrel domain 

45 . having more than one binding site. 

The p-barrel domain(s) comprising the binding regions has amino acid sequence and structural homology with 
variable regions of molecules related to the Ig Superfamily of molecules. Specifically, the binding sites on the x-protein 
are comprised of hypen/ariable regions derived from molecules related to the Ig Superfamily of molecules. The Ig 
Superfamily includes immunoglobulins, cell surface antigens, such as T lymphocyte antigens, and cell surface recep- 

50 tors, such as immunoglobulin Fc receptors. 

In a preferred embodiment, the hypervariable regions are complementarity-determining regions (CDRs) derived 
from the antigen binding sites of immunoglobulins. In this embodiment, the multivalent protein analogue comprises 
one or more polypeptide chains forming a p-barrel domain containing CDRs Interspersed between framework regions 
(FRs). These CDRs define one antigen binding site. This multivalent protein analogue also has one or more additional 

55 antigen binding sites spliced into the FRs of the p-barrel domain. 

In one embodiment, the x-protein will comprise a single polypeptide chain forming a p-barrel domain. In another 
embodiment the x-protein will comprise a single polypeptide chain comprised of two polypeptide chains, connected 
by a polypeptide linker spanning the distance between the C-terminus of one chain to the N-terminus of the other chain 
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forming a p-barrel domain. In yet another embodiment, the x-protein will comprise two polypeptide chains with two 
non-covalently associated chains forming a p-barrel domain. In each of the above embodiments, the polypeptide chains 
fold to form a p-barrel domain with two or more binding sites. 

The invention also relates to the amino acid sequences that encode the x-protein, the DNA sequences that encode 

5 the amino acid residues fomiing the x-protein, and expression vectors comprising and capable of expressing the DNA 
sequences. The invention also relates to methods of producing the x-Proteln. 

The invention further relates to compositions comprised of the x-proteins and methods of use thereof. These 
compositions are comprised of x-proteins having two biologically active binding sites and may be used in a variety of. 
therapeutic and diagnostic procedures. These compositions include, but are not limited to, x-protein biosensors which 

10 undergo a conformational change when a ligand is bound to one binding site such that the affinity of the second binding 
site is modified; and x-proteins having one binding site reactive with a tissue-specific ligand and the second binding 
site reactive with radioactive bns, radio-opaque substances, cytotoxic substances, cytotoxic effector cells (e.g. cyto- 
toxic T cells) drugs, or catalytic substances. These compositions also include a "biochip" comprised of a two-dimen- 
sional array of aggregated x-protein biosensors, such as in a Langmuir-Blodgett film, to make a tunctionalized mem- 

is brane useful for layers of molecular gates for computers and the like. 

The utility of binding proteins having two independent binding sites of different specificity for the treatment or control 
of tumors, virus infected cells, bacteria and other pathogenic states has been recognized (Segal. D.M. and Snider, D. 
P.. Chem. Immunol. 47:179-213 (1989)). Bispecific binding proteins have been produced by crosslinking two or more 
dissimilar but intact antibodies with a chemical agent (heteroantibodies); crosslinking antibody fragments; linking two 

20 ' single chain antibodies; or fusing a single chain antibody to an effector molecule (Segal etal U.S. 4.676,980), Tai. M., 
et al. Biochem. 29:8024-8030 (1990). 

However, despite considerable application to medical research, these previous attempts to produce bispecific 
binding proteins suffer from the difficulty of complex purifications and large molecular size. For example, each binding 
region of an IgG is at least 50 kD, so that a conventional crosslinked, bispecific heteroantibody can range in mass from 

25 100-150 kD to as much as 300 kD, if two intact IgG antibodies are crosslinked. Even the single chain antibody has a 
mass of approximately 26 kD so that a single polypeptide chain comprising two separate binding regions, each com- 
prised of a single chain antibody, has a mass of approximately 50 kD. 

The recombinant-engineered x-proteins of the present invention have significant advantages over conventionally 
crosslinked antibodies, or even the single-chain Fv The proteins of the present invention can be custom-designed to 

30 bind specific ligands and cell surface receptors with affinity or specificity. These custom-designed multivalent binding 
proteins can be smaller and more compact in size than intact hetero-bispecific antibodies, Fab' antibody binding frag- 
ments or bispecific sFv-sFv constructs. 

These x-proteins can also be less immunogenic thereby reducing the likelihood of immune reactions to such ther- 
apeutic compositions. For example, in the intact immunoglobulin molecule, the bottom loops of the variable region are 

35 sterically protected from recognition resulting in an immune response. However, when disassociated from the protecting 
constant region, as in the Fv. the exposed bottom loop region potentially becomes antigenic. In ax-protein, this bottom 
loop region is no longer exposed and thus significantly reduces the likelihood of an Immune response. Furthermore, 
due to their smaller size, the x-proteins can have enhanced stability when administered intravenously as they will be 
less susceptible to proteolysis by endogenous proteases than larger, multi-domain proteins. 

40 

Brief Description of the Drawings 

The foregoing features and advantages of the invention will be apparent from the following more particular de- 
scription in the following drawings and text. 
45 Figure 1 is a schematic representatton of a typical Ig Superfamily molecule, an immunoglobulin, depicting the 

variable (V) and constant (C) regions. 

Figure 2A is a schematic representation of an Fv depicting the relative positions of the top loops (TL) and bottom 
loops (BL) of the folded heavy and light chains. The TLs typically form the CDRs of the Fv and the BU are typically 
adjacent to the C region when within an intact immunoglobulin. The BLs comprise the loops suitable for splicing on 
50 the second binding site. The shaded strands are the inner strands (IS). The unshaded strands are the otiter strands 
(OS). 

Figure 2B shows an alignment of Vl and Vh amino acid sequences for which three dimensional structures are 
known (SEQ ID NOB: 1-9 for Vl and SEQ ID NOS 10-16 for V^). The alignment of these sequences is based on the 
structural homology that exists between them, especially In the p-strand regions. Regions of the sequence correspond- 
55 ing to structural regions of inner-p-strand (IS), outer-p-strand (OS), top loops (TL) and bottom loops (BL) are identified. 
The number scale corresponds to structural position, especially in the OS and IS regions. 

Figure 3A is a stereo figure depicting two copies of the McPC 603 Fv structure wherein the top loops of one (right 
side up) structure are superimposed with the bottom kxjps of the other (upside down) structure. For clarity, only the 
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top loops (ribbons) and bottom loops (line trace) are shown. 

Figure 3B is a stereo figure depicting the alignment of H2 with the N- and C-termina! strands of the Inverted Fv 
structure. 

Figures 4A-4G depict in stereo the positions of the native CDRs and the CDRs spliced into the McPC 603 bottom 
5 loops to form a additional binding site (x-slte). Corresponding native and x-site CDR loops are highlighted as ribbons. 

Figure 5 depicts the stereo comparison of the x-protein comprised of native McPC603 with McPC603 CDRs spliced 
into the BLs of the native McPC603. 

Figure 6 depicts the splice points used when H3 of a second McPC603 is spliced into H BL2 of the native McPC603 
sFv. The ribbon follows the spliced trace. 
10 Figure 7 depicts the splice points used when the L3 loop Is spliced into the L LB2 of the native McPC603. 

Figure 8 depicts the splice points used when the H1 loop is spliced into H BL4 of the native McPC603. 
Figure 9 depicts the splice points used when the LI loop is spliced into L BL4 of the native McPC603. 
Figure 10 depicts the splicing of the H2 loop into the C-terminus of Vh of native McPC603. 
Figure 11 depicts the splicing of the L2' loop onto the C-terminus of Vl of native McPC603. 
IS Figure 1 2 shows the final amino acid sequence of the x-protein comprised of two McPC603 binding sites as con- 

structed according to Examples 1 and 2 (SEQ ID NO: 17). 

Figures 1 3 A and B show the alignment of consensus sequences for the various sequence classes In the Kabat 
et al. compendium with the alignment of sequences of known structure for the Vh (Figure 13A) and Vl (Figure 13B) 
chains; (Kabat, E.A., SEQUENCES OF PROTEINS OF IMfy/IUNOLOGICAL INTEREST Vols. I-Ill, U.S. Dept. Health 
20 and Human Services. NIH Pub. No. 91 -3242 (1 991)). The residue preference list in Table 2 is derived from this figure. 
The first group of sequences are as in Figure 2B (SEQ ID NOS: 1-16). The second group of sequences are consensus 
sequences from the groupings in Kabat etaL (Figure 13A, SEQ ID NOS: 18-28) and Figure 13B, SEQ ID NOS: 31-38). 
The third group of sequences are known sequences for mouse immunoglobulins 2610 and GLOOP4. (Figure 13A. 
SEQ ID NOS: 29-30 and Figure 1 3B. SEQ ID NOS: 39-40). The line labeled KABAT indicates the boundaries of f rame- 
2S work and CDR regions as defined in Kabat et ai. ("-" indicate alignment gaps). 

Figure 14 shows the sequences of x-protein constructs as constructed according to Examples 2 and 3 (SEQ ID 
NOS: 41-44). 

Figure 15A shows the nucleic acid and amino acid sequences of the x-1 protein, with double dashes indicating 
the D1.3H1 X'loop insertion in the 26-10 sFv (SEQ ID NOS: 45 AND 46, respectively). 
30 Figure 15B shows the nucleic acid and amino acid sequences of the x-2 protein, with double dashes indicating 

the D1.3H1 and H2 x-loop insertions in the 26-10 sFv (SEQ ID NOS: 47 AND 48, respectively). 

Figure 16A shows the SDS polyacrylamide gel electrophoresis of the x-1 protein purified by ouabain-Sepharose 
affinity chromatograph. 

Figure 16B shows the SDS polyacrylamide gel electrophoresis of the x-2 protein purified by ouabain-Sepharose 
35 affinity chromatography. 

Figure 17 shows the alution profiles of the x-2 protein and the 26-10 sFv from a Superdex 75 column. 

Detailed Description of the Inventbn 

40 The present invention relates to a chimeric multivalent immunoglobulin (Ig) Superfamily protein analogue, herein- 

after referred to as a x-Protein. comprising one or more polypeptide chains forming a p-barrel donnain. The p-barrel 
domain, contains hypervariable regions (hereinafter called complementarity-determining region-like (CDR-like region)) 
and structural regions (hereinafter called framework-region-like (FR-like region)). The CDR-like regions define ligand 
binding sites. Additionally, the x-protein has at least one more ligand binding site segment spliced into the FR-like 

45 regions of the p-barrel domain. 

The Ig Superfamily molecules show significant amino acid sequence homology within their p-barrel domains. (Wil- 
liams. A.F. and Barclay, A.N., IMMUNOGLOBULIN GENES p. 362 (1989)). The amino acid sequences of the amino 
terminal domains are called variable (V) regions and the more conserved sequences ot the remainder of the chain, 
termed the constant (C) region. (Figure 1) (Abbas. A. K.,etaL CELLULAR AND MOLECULAR IMMUNOLOGY p. 45 

so (1991)). The amino acid sequences of both the V and C regions of Superfamily molecules are formed on two different 
polypeptide chains, the heavy (H) chain and the light (L) chain in immunoglobulins, or the a, p, y, 6, or e chains in other 
Ig Superfamily molecules. These chains also fold Into V and C regions. For example, each H chain of an immunoglobulin 
molecule folds into a Vh domain with an adjacent Ch domain and each L chain folds into a Vl domain with an adjacent 
Cl- Each chain has additional successive constant regions (Figure 1). 

55 The V region contains highly variable, unconsented stretches of sequence called the hypervariable regions. More 

conserved stretches of sequence are called structural regions. In immunoglobulins, the hypervariable regions are called 
complementarity-determining regions (CDRs) and the structural regions are called framework regions (FRs). Three 
CDRs of each Vh, and three CDRs of each Vl combine in a unique three-dimensional structure to form the antigen or 
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ligand binding site. These CDRs determine tlie ligand specificity of the protein. (Abbas, A. K.. et al.. CELLULAR AND 
MOLECULAR IMMUNOLOGY p.143 (1991)). 

Hereinafter, the hypervariable regions of all Ig Superfamily molecules will be called CDR-like and all conserved regions 
will be called FR-tike (or CDRs and FRs in the specific case of an immunoglobulin molecule). 

s The Ig Superlamily members also show significant homology in the structural three-dimensional features of their 

Vand C domains. This structural feature is known as the Ig-fold. (Williams, A F. and Barclay, A.N., IMMUNOGLOBULIN 
GENES p. 362 (1 989)). The Ig-fold consists of a sandwich of two p-sheets constructed from anti-parallel p-strands, 
each strand containing five to ten amino acid residues. The V domain differs from the C domain by an extra pair of p- 
strands in the middle of the V domain. (Williams, A.R and Barclay, A.N.. IMMUNOGLOBULIN GENES p. 362 (1 989)). 

10 For example, the V domains or C domains of an immunoglobulin, associate in pairs, such as Vh^Vl- Each p-barrel 
domain, when associated in such a pair, forms two concentric p-barrels, with the CDR loops connecting anti-parallel 
p-strands of the inner barret. 

Members of the Ig Superfamily Include, but are not limited to, Immunoglobulins, T cell Receptor Complex, Major 
Histocompatibility Complex Antigens, P2 Microglobulin-associated Antigens, T Lymphocyte Antigens, Growth Factor 

IS Receptors, and Neural Cell Adhesion Molecules (NCAM). (Williams. A.R and Barclay, A.N., IMMUNOGLOBULIN 
GENES p. 362(1989)). 

Each ligand binding site of the x-protein is comprised of the CDR-like region derived from molecules of the Ig 
superfamily. For example, a ligand binding site could be comprised of the CDRs derived from an immunoglobulin 
molecule whose ligand is an antigen. Alternatively, the ligand binding site could be comprised of CDR-like regions from 

20 a receptor molecule such as the T cell receptor whose ligand is the antigen-major histocompatibility complex (MHC) 
molecule which, upon binding to its receptor, initiates T cell activation. 

In particular, the present invention relates to the immunoglobulins of the Ig Superfamily In natural immunoglobulins, 
the antibody combining site is formed by CDRs of the Vh and Vl variable domains within the Fv (variable region 
consisting of noncovalently associated and Vl or V^* vj, as shown in Figure 1 . In addition to the CDRs determining 

2S binding specificity, maintenance of the tertiary structure is also necessary for biological activity. The CDRs are correctly 
positioned by the conserved framework regions (FRs) within the V regions, and the V region is further stabilized by 
the C regions of the protein. However, the minimal naturally-occurring antibody binding site is the two chain, non- 
covalently associated Fv. 

Recently, through recombinant protein engineering, a single-chain Fv (sFv) has been constructed. In this approach, 

30 the genes encoding and Vl domains of a given antibody are connected at the DNA level by an appropriate oligo- 
nucleotide linker and, on translation, this gene forms a single polypeptide chain with a peptide linker bridging the two 
variable domains. (Huston, J.S. et aL. Meth. Enzvmol. 203:46-48 (1 991 )). 

Isolated single domain antibodies have also recently been constructed, comprised of only three CDRs instead of 
the usual complerrient of six. (Ward, E. S:, et al.. Nature 341: 644-546 (1989)). In some cases, these single domain 

35 antibodies (i.e., V^ or Vl) exhibit binding activities comparable to their parent antibodies. 

In a preferred embodiment of the present invention, one of the x-Protein binding sites is comprised of a set of 
CDRs from a mouse myeloma protein such as 26-10 or McPC603 (Huston, J.S., et al. Methods Enzvmol. 203: 46-88 
(1991). An additional binding site segment is spliced into the P-barrel domain. This spliced segment is comprised of 
CDRs from the same, or a second mouse myeloma protein, which are spliced into the bottom FR loops of the p-barrel 

40 domain, or attached to the C-terminal ends of each V domain. 

The structural basis for this invention can be explained with reference to Figure 2A and 2B (SEQ ID NOS: 1-16), 
where the typical variable region composition of an immunoglobulin is noted. There are three CDRs within each V^ 
and Vl which together constitute a complete antibody binding site. These CDRs are interspersed between FRs such 
that the light chain V region is denoted FR1-L1-FR2-L2-FR3-L3-FR4 and the heavy chain is denoted 

45 FR1 -H 1 -FR2-H2-FR3-H3-FR4. Each of these V region chains folds into a native conformation that comprises a double 
layer of p-sheets. V domains may be monomeric, (Vh or Vl), or associate into homodimers, (Vh»Vh or Vl»Vl), or into 
the heterodimeric Fv, (Vho Vl). Each of these dimers may be constructed as single-chain analogues. In the single-chain 
Fv, these two sequences are connected in tandem with a bridging linker to make, for example, V^-linker-VL (V^-Vl) 
or VL-linker-VH (Vl-Vh). 

50 In these folded configurations of the V region polypeptide, alternating directions of the p-strands loop out at either 

end as they form anti-parallel strands. In each region the immunoglobulin fold has loops on top (the binding site is "on 
top") and on the bottom (partly in close proximity to constant domains for intact H and L chains). 

On top of each V domain, four loops are present of which three are CDRs that contribute to the antigen binding 
site, and on the bottom, four loops are present that allow the p-strands to switch directions and fold back into the 

55 globular domain. These bottom loops are here termed BL1, BL2, BL3, and BL4, and apply to the L and H variable 
regions, respectively called L BL1-4 and H BL1-4 (Figure 2A). These bottom loops provide insertion sites for fusion 
proteins or peptide segments, which are further augmented by splicing of peptide sequence at the C-terminus of each 
V regfon. The use of these bottom loops as splice sites permits incorporation of alternate binding, catalytb or effector 
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sites which complement the naturally present antigen binding site. ^,■.„^,„„ 

However, novel demands are made on variable region archttecture in order to construct acW.t|onal b^duig srtes 
on the bottom of a V domain. In particular, there is a requirement that the relative orientation of the CDR-like toops 
(CDR or top loop symmetry) for an Fv region be reproduced to a reasonable approximation in the relatwe onentation 
of the bottom lois of the Fv (bottom loop symmetry). Whetheror not this symmetryrelatiori^ 

question to ask and the possibility of top-bottom loop symmetry has not been P^°^'°"= ; "^^"^^^^^^^^ 
molecular modellngandcomputatlonalana^slsdemonstratethatsuch an approximation totheCDR-hkeloopsymmeto^ 

does exist for the bottom loops of the Fv region. „^„„;„,„ 

The superposition of top and bottom loops involves two interrelated obsen/ations: (1 ) the wo fixed endpoints d 
each top l^,p (CDR) must find a corresponding match with bottom loop endpoints; and (2) '^e Polypeptide c^^^^^^ 
directionalrty must be maintained after bottom loop splicing (i.e.. N to C directionality of the spliced CDR mus be he 
same as the bottom loop that it replaces). The overall architecture of the Fv region appears asymmetnc because he 
ends of the light and heavy chain p-barrels are closer together at the top than they are at the bottom. However, much 
onhis apiSnt differencrder-^es f rom there being one quarter of the Fv residues In CDRs on top of the framework, 
which fill in space that is open on the bottom. . , 

in fact, there is sufficient correlation between top and bottom loop symmetry of the '"tact ^v region to^^^^^^^ 
multiple binding srte molecule possible. This is discernible if one superimposes the top of one Fv (#1) with he b«tom 
of Mother Fv (#2); H #1 and #2 are the same, then the correlation helps decide how to construct a multivalent Fv^ 
whereas if #1 and #2 are different, the additional binding site (x-site) is distinct from the native Fv binding site and 
overall one thereby designs a chimeric multivalent Ig Superfamily protein analogue which has one or more sites of 

'^'"^W? XeSent the composrtion of the x-site with parentheses so that A(B) represents a x-protein comprising 
the CDR-like regions of immunoglobulin Superfamily molecule B built into the x-site of immunoglobuhn Superfam,^ 
moleculeoranalogue A. Likewise (VH(VJrepresentsaVHdomainwithax-site whose loops are derived from^^^^^^ 

loops of a Vl domain. Consequently the designation A(B) Vh(VO-Vl(Vh) denotes a x-prote,n based °n th« FR and 
CDR of Fv A having a x-site based on the CDR loops of Fv B where the x-site loops on the heavy chain ofA are derived 
fio^the CDR loops ofthe light chain B and wsa versa, and the two x domains are linked by a po^peptide linker such 
that the heaw chain FR domain precedes the light chain FR domain. 

AS show^ in Figure 3A. by applying these procedures to McPC603 for both Fv #1 and #2. the following pairs of 

loops align: 

1 . The TL4 loops (L3 and H3) of Fv#1 can be aligned very closely with the BL2 loops of Fv#2. and with the alignment 
of these two sets of loops, additional alignments become apparent. 

2. The TL1 loops (LI and HI) are approximately superimposed on the respective BL4 loops. 

3. in addition, as shown in Figure 3B, the ends of the TL2 kjops (L2 and H2) are approximately superimposed on 
the N-terminal and C-terminal strands of the respective ^-barrel domains. 

In this structural alignment, all of the fundamental design criteria for splicing a second binding site onto the bottom 
of V Fv or sFv regions are satisfied: the proper chain directionalities and superpositions are found for the Tl^4 - BL2 
oair' the TL1 - BL4 pair, and the ends of the TL2 loops relative to the N- and C-temilnal strands of the p-barrel region. 

"This construction is consistent to a good approximation with the natural geometry for all CDR loops, thereby gen- 
erating a ligand binding site on the bottom of the p-barrel domain similar to the "source" Fv binding sitejim ilarly an 
additi^al Sgand binding site may be buitt on the separate non..ovalently linked V domains of a heterodimem^^Fv (i- 
e V„.V, ) Alternatively, as it has recent^r been shown that the full set of 6 CDRs is not necessary for ligand binding, 
partial binding sites (i.e. , fewer than 6 CDRs) may be assembled on a single Vdomain. Thus, ax-protein could comprise 
KlTba'elU^^^ 

^"^^'hVi or L BL1 loop replacement could also be useful, as they are In proximity to the rest of the binding site. 
However, their peptide chain directionality is opposite to what would be needed to correctly splice in H2 and L2 loops. 
Nonetheless such loops could be devised de novo by design or mutagenesis to facilitate such replacement. 

Furthemiore. in most Fv regions, the H or L BL3 loops are located on the sides of the V domains. 
Although these k»ps are not contiguous with the rest of the additional binding site, ancillary loops could be attached 
at such sites, thereby providing the addition of a "ligand-like" surface feature for recognition by the appropriate receptor, 
thus, forming a x-protein with more than two binding sites. A single substitution of only one loop with an appropnate 
peptide, could provide a means to anchor the binding protein to a particular receptor. Moreover the H or L BL3 (or H 
or L TL3) loops could provkJe the means to crosslink single domain x-protelns into aggregate sheets to form two 
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dimensional arrays of x-proteins. 

CDR-like region sequences of any of the proteins Irom the Ig Superfamily showing the requisite sequence and 
structural homology can be spliced into the bottom loops of the source Fv. It should be noted that CDR-like region 
sequences from other Ig Superfamily molecules can also replace all or a part of the native CDR-like region of the x* 
s protein. Thus a x-protein can comprise the CDR-like regions from molecule A, the FR-like regions from molecule B 
and the x-site having CDR-like regions from molecule C spliced In. 

As discussed in detail in Examples 1 and 2, using the Fv derived from mouse myeloma IgA antibody, McPC 603. 
a second McPC 603 binding site can be constructed on the bottom of the source Fv. Although the Fv region of McPC 
603 is exemplified, as shown in Example 3, it is reasonable to splice in a CDR-like region from other Ig Superfamily 
10 proteins, due to the sequence and structural homology that exists among these proteins. (Figure 2B (SEQ ID NOS: 
1-16)). 

When CDR-like sequences are spliced into the lower FR-like loops of the source Fv, It is important to preserve 
those FR-like residues that are critical for maintaining the proper folding. These critical FR-IIke residues are located in 
the stems of the loops and underlying the CDR-like regions. Three criteria govern the selection of the proper splice 

IS points. First, as discussed above, the need to preserve critical FR-like residues; second, the desire to incorporate as 
much of the CDR-like sequence as possible; and third, the practical need to switch from one backbone to the other at 
points where the alpha carbons of the two chains are reasonably well aligned. The last requirement is necessary in 
order for the spliced loops to maintain their native, biologically active conformation. 

As described in detail in Examples 1 and 2, the crystallographically determined coordinates of a mouse myeloma 

20 protein structure, such as McPC603, are visually displayed using a suitable computer graphics system. This display 
of the protein in three-dinriensions, can be spatially rotated, turned or twisted as necessary to view the polypeptide 
chains or peptide backbone of the Fv region. Using the graphics program, substitutions, additions or deletions can be 
made to the structure. These modifications are energy minimized (optimized) to account for steric hindrance, bond 
lengths, bond angles and energy constraints so as to maintain the critical tertiary structure necessary for biological 

25 binding activity. Thus, in a step by step process involving modification of the polypeptide chain and successive cycles 
of refinement calculations for each modification, a three-dimensional model of the x-protein, with two distinct binding 
sites, is built. 

As described In Examples 1 and 2, the model used was McPC603 sFv constructed as Vh-Vl- However, sFv proteins 
constructed as Vl-Vh, Vh-Vh, or Vl-Vl are also intended to be encompassed by this invention. Two chain Fv regions 

30 (i.e., non-covalently associated Vh and Vl domains, (Vh-VJ, as well as single domain antibodies (Vh or VJ are also 
intended to be encompassed by this Invention. In addition, any other Ig Superfamily molecule having the requisite 
sequence and structural homology are also intended to be encompassed by this Invention. 

Alternatively, since the sequence would be known for the Ig superfamily CDR-like regions of interest, alignment 
procedures that relate tertiary to primary structure are useful to predict successful x-proteins. Figures 13A (SEQ ID 

35 NOS: 10-16 and 18-30) and 8 (SEQ ID NOS: 1-9 and 31-40) depicts the amino acid residue alignment of Fv regions 
from mouse myeloma antibodies and other Ig superfamily molecules. As described In detail In Example 3, this type of 
sequence alignment allows approximate choice of splice points without having a three-dimensional crystallographic 
structure. The tertiary structures of framework regions are highly conserved, thus allowing reliable three-dimensional 
models to be predicated on sequence alignment methods. 

40 However, to correctly splice "target" CDRs-like regions Into the source Fv, some modifications to the source FR- 

llke region sequence may be necessary. Moreover, it may be desirable to modify the native CDR-like sequence which 
is spliced into the source Fv, or the source Fv itself, to modify binding affinity or specificity. Such modifications are 
Intended to be encompassed by the subject Invention. 

For example, one or more amino acid residues can be substituted by another amino acid of a similar polarity which 

45 acts as a functional equivalent, resulting in a silent alteration. Substitutes for an amino acid within the sequence may 
be selected from other members of the class to which the amino acid belongs, such as the nonpolar, polar, and positively 
or negatively charged amino acids. 

The structure may be modified by deletions, additions, substitutions and insertions of one or more amino acids 
which do not substantially detract from the desired functional properties of the x-protein. Naturally occurring allelic 

50 variations and modifications are included within the scope of this inventfon so long as the variation does not substantially 
reduce the ability of the x-proteIn to bind Its llgand. 

Based on the method described in Example 3, ax-protein has been partially constructed Incorporating two loops 
of an anti-tysozyme monoclonal antibody In the 26-10 sFv, as described In detail in Example 4. As shown In Figure 
15A and 15B (SEQ ID NOS: 45-48), constructions have been made which incorporate the HI and H2 loops of the D1 .3 

55 anti-lysozyme monoclonal antibody in the appropriate x sites at the bottom of 26-10 sFv, where they are designated 
HI ' and H2'. The design process has involved successive addition of x-loops, so that there are constructions with HI' 
alone (X-1 . Figure 15A). and HI' + H2' together (x-2. Figure 15B). In constructing x-1 and x-2, the only deviations from 
the corresponding partial constructs described in Example 3 (Figure 14, 2610(D1 .3) sequence) are the following: (1) 



7 



EP 0 640 130 B1 



ni^rnR^(H3' LV L2' and L3') can be accomplished by the same procedure. ^ , , . ^ 

A« .«^h CDR COD inserted in the x-site, the cloned proteins were expressed in EM and refolded. The 
f fh^ilTnd^o caoacftv bv hese v-protelns demonstrates that the insertion of these x-CDRs was com- 

enzymatic or chemical combinations of appropriate^, modified blocks of poptdes^ 

'"^AnToTL standard recombinant methods for the insertion of DNA into an «'^P^«^^;^°^^" ^7,;^^^^^^^ 

Sral mSiSrSunen^ post-translationai mod.cation could allow specif, incorporafon of mo.e- 

^o^S^^^^ di'ution refo«ing, redox refolding and disu«ide-restricted 

■ o . . kL h^c pLvmol 203-46-88 (1991) the teachings of which are hereby incorporated by reference). Dilution 
. i^SelirgSg^J^^^^^^^^^^ and dLtured -ibody frag-ts^^^^^^^^^ removal J de- 

naturSand reducing agent wrth recovery of specific binding actK,ity. (Haber, E.. Prnr Natl Acad. Sc. U.S.A., 53.524 

''rLx refdding utilizes a glutathkx,e redox couple to catatyze dfeuifide interchange as the protein refolds into its 
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native state. (Saxena. P. and Wettaufer. D.B.. Biochem.. 9:5051 (1971); (Huston, J.S., et al. Method s Enzvmol. 203: 
46-88(1991)). 

DIsulfide-restrlcted refolding involves initial formation of intrachain disulfides in the fully denatured protein. This 
capitalizes on the favored reversibility of antibody refolding when disulfides are kept Intact. (Buckley C. E.. et al .. Proc. 

5 Natl. Acad. Sci. U.S.A.. 50:827 (1963); Huston, J.S.. etal. Methods Enzvmol. 203:46-88 (1991)). Disulfide crosslinks 
should restrict the Initial refolding pathways available to the molecule. For chains with the correct disulfide pairing, the 
recovery of a native structure should be favored, while those chains with Incorrect disulfide pairs must necessarily 
produce nonnative species on removal of denaturant. 

Characterization of the multiple ligand binding sites requires that both their affinity and specificity be determined. 

10 Ideally, the measurement of binding affinity should use a thermodynamically rigorous approach such as equilibrium 
dialysis or ultrafiltration. In the absence of such methods, agreement between two distinct methods is desirable to 
reinforce the veracity of the data. Ligand binding to high affinity binding sites frequently involves very fast rates of 
binding and slow rates of dissociation. This characteristic makes measurement of their association constants amenable 
to routine immunoassay procedures, such as immunoprecipitation, radioimmunoassay and ELISA. (Huston, J.S., et 

IS al. Methods Enzvmol. 203:46-88 (1 991 )). 

After evaluating the binding activity of a particular model x-protein, it may be desirable to further refine the initial 
X-protein to enhance its biological activity. This refinement may be performed computationally by additional computer- 
ized modeling or it may be a "biological" refinement using recently developed techniques such as the use of genetically 
engineered bacteriophage to display and/or secrete the x-protein and thereby select for an improved confirmation. 

20 (Marks, J.D., etal., J. Mol. Biol. 222:581 -597 (1 991 )). 

Additional optimizing techniques, as described above, may also be used to confer ■special" properties on the %• 
protein, such as "humanizing" the x-protein using human FR-like regions as the scaffold protein to reduce the possibility 
of adverse immunologic reactions during therapy (Daugherty, B. L. et al.. Nucleic Acids Res. 19:2471-2476 (1991)). 
Special properties may also include increased solubility or stability or improved renaturability or secretion. 

25 This invention further relates to the method of producing the x-protein which Includes the steps of determining the 

splice points for additional binding sites, as described in Examples 2 and 3, determining the amino acid sequence of 
the resulting construct, deducing the DNA sequence encoding that amino acid sequence, inserting the DNA sequence 
into an appropriate expression vector and expressing the protein in a suitable host system, refolding the protein to Its 
biologically active conformation and analyzing its biological binding activity For example, in the case of a x-Protein 

30 with catalytic activity, this would include measurement of enzymatic properties. 

The ability to target therapeutic agents in a host with antibodies has been a long-term goal of medical research. 
The term host, as used hereinafter, is intended to encompass mammalian hosts, including humans. The most elegant 
targetable proteins would consist of the minimum structures needed for selective delivery and effector function. (Tai, 
M., et al.. Biochem. 29: 8024-8025 (1990)). Although the utility of binding proteins having multiple binding sites of 

35 different specificity has been widely recognized (U.S. Patent 4,676,980), these crosslinked antibodies or crosslinked 
antibody fragments suffer from the need for complex purification schemes and large molecular size. 

Notwithstanding the construction of the smallersized sFv, the need still exists for a multifunctional binding protein 
with as reduced a size and as compact a shape as possible to permit effective therapeutic use. The subject invention 
also relates to use of these x-proteins in therapeutic and diagnostic procedures. These uses include, but are not limited 

40 to, in vivo and in vitro imaging agents, delivery agents for drugs, radtoisotopes, and cytotoxic substances. The x-protein 
could also include a binding site for an effector molecule, such as an enzyme, growth factor, celldifferentiation factor, 
lymphokine, cytokine, hormone, anti-metabolitic, or an ion-sequestering sequence such as calmodulin. The x-protein 
could further Include a binding site reactive with antibody-dependent cytolytic cells or cytotoxic T cells. Also included 
are x-proteins with catalytic or biosensor activity, as well as binding sites that facilitate affinity purification procedures. 

45 In some instances, the primary binding site could be a high affinity site that targets the protein to specific cell 

surface locations, and the secondary site designed to decrease normal receptor activity. Ax-protein could be designed 
with one binding site comprising a receptor for a cell surface antigen and a second binding site comprised of ia modified 
receptor with diminished affinity for its ligand. Thus the x-protein would to bind a cell via the normal binding site, leaving 
the binding site with diminished binding activity exposed, thereby resulting in decreased receptor activity For example, 

so a x-protein could be designed with neural cell adhesion molecule (NCAM) variable regions that would modulate cell 
interactions such as contact inhibition of malignant cells. 

Ax-protein may also be designed to modify, enhance or inhibit cell-celt interactions, or communication. For exam- 
ple, one binding site could be designed to target a cancer cell and a second binding site could be designed to target 
an effector cell, such as a macrophage, thus binding the macrophage in a manner which results in destruction of the 

55 targeted cell. 

Moreover, a x-protein could mediate phenomena such as antibody-dependent cellular toxicity. (Huston, J.S., et 
al.. Proc. Natl. Acad. Sci. USA. 85:5879-5883 (1988)). For example, a x-proteIn could be designed with one binding 
site comprised of a receptor for a cell surface marker protein and a second site derived from a receptor for killer T cells. 
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Thex-protein would bind to the target cell and the secondary site would bind the cytotoxic T cell. ^«s""i"9 in destruction 
ot the target cell. The x-proteins of the present invention would exhibit signif icantty greater tissue accessibility or tumor 
penetration and fasterpharmacokinetics due to their compact size. . hinHinn rRh«t T IM 

Recently it has been shown that an Fv undergoes a conformational change upon antigen binding. jBhat, T. N., 
Nature, 347:483 (1990)). It is reasonable to predict, that the x-proteins of the present invention ^^'^ ""f^,^^;;)'^^ 
^^mational changes, due to the significant sequence and structural homology with the native Fv Although this 
crnforma onal change is modest, it involves the translational movement of the Vh and V, domains re at«,e to each 
oCsu^tha L oLtation of the bottom loops can sh«t. This makes the x-pr^^^ 

SSeti or ^^^^ binding sKe. and requires that linkage exists between their binding equilibria. It ,s reasonable 
to predict that this conformational change would enable the x-protein to act as ^ J^^l'^^l^ , 

For example, the first site of the x-protein may be a catalytic antibody combining site (e.g.. a site that catalyzes 
the conversion of a "pro-drug" to a cytotoxic drug), while a second site is specific for binding to a marker on a ceH 
surface (e g . a tumor cell). If binding to the cell surface epitope induces a conformational change in the x-protein such 
that the 4s of the catalytic srte are brought Wo optimal geometry for efficient catalysis, then the P^^^^rug wou W^^^ 
converted to cytotoxic drug directly at the site of the tumor, so that high cytotoxic levels could be main ained at the s te 
while serum levels remained low The unbound catalyst would have low efficiency and. because of its low molecular 
weiaht would clear rapidly from circulation. 

The x-proteins are well suited to applications in immunotargeting because of their binding capabilities and compac^ 
size The absence of constant domains will reduce nonspecific background binding, thus enhancing visuatetion of 
target tissue by Invjyo or invrtro imaging procedures. The use of x-ptoteins in invjtro diagnostics would also reduce 
nonspecific binding thereby increasing the accuracy of these assays. „^in^„ioc 

Thex-proteins of the present invention arealsousefulto deliver drugs, cytotoxicsubstances or effector ^^^^^^ 

to immunotergeted tissues. One of the binding sites of the subject protein may exhibit catalytic activrty that is tnggered 
bv binding of a specific ligand to the other binding site. ...... 

The invention will be further illustrated by the following Examples, which are not intended to be limited in anyway 



Example 1 



Constructing a Model of Bivalent McPCSOSysfv 

The following illustration, which utilizes the McPC 603 Fv region (SEQ ID NOS: 1 and 10). for which the three- 
dimensional crystallographic structure has been determined, has been chosen in order to assess how ^^^P^^^^^' 
set of splice points will generate a x-site (addftional binding site) that recreates the parent binding site^ This example 
also reveals the general method of building a x-protein model, by the specific example o mapping the McPC603 Vh 
CDRs to Vh bottom loop posittons, and the V^ CDRs to V^ bottom loop posrtions. Molecular modeling was pertorrned 
on a Silicon Graphics 4D/70GT Superworkstation usingthe Biosym programs INSIGHT (to visualize models), HOMOL- 
OGY (to assemble spliced CDR/FR sequences), and DISCOVER (to minimize the energy of the HBic«ym^^^^^ 
San Diego. CA). The HOMOLOGY program was used to assemble the structure in five s eps, the DISCOVER 
program v^s u ed to minimize the energy of the resulting model about the splice points. A though, HOMOIjOGY was 
Ssed in this example, this program fe not necessary to buiW the model. Any one of several alternative molecular me- 
chanics proqrams such as AMBER and CHARMM. could be used instead of the DISCOVER program^ 

To cSes of the McPC603 sFv structure were superimposed (Figure 3A and 3B): model A had the binding site 
up and the model B had the bottom loops up, the CDR loops of B superimposed on the lower toops of A, as described 
above. The primed loops (e.g. H2') are those spliced into the x-site. 

1 . Building the H2- loop and the bridge. (Figures 4A and 4B). The A coordinates for the Vh dorriain were used up 
to where H2' from structure B splices into the C-terminus of structure A and the A coordinates of the linker and the 
following V, domain were likewise used. The coordinates of the H2' segment were taken from B. Once coordinates 
had been assigned to the residues flanking the bridge peptide region, the HOMOLOGY loop search algorithm was 
used to find a 5 residue peptide whose flanking 3 alpha carbons overlapped well with the 3 amino acids on either 
ide oHhe gap, and whose confomiation best fit the surface of the Vh domain. The HOMOLOGY Program^auto- 
matically connects the peptide bonds at the splice points to create the new single chain structure called MDL1 It 
Sge erally be necessa^ to rotate side groups in the interface between Vh and H2- out of the way in o^er o 
reduce steric conflicts. One residue in H2' had to be changed because simple rotation couW not eliminate itsster^; 
conflicts- Phe 137 (position H70)-^ Ala. Using the DISCOVER program, the atoms about the splice points were 
moved so as to minimize contributions to the energy of the structure due to improper bond lengths and geometries. 
At this point the composition of bridge peptide was polyalanine; additional computational analysis of the model 
may be used to further assess the optimum length, path and composrtfon of the bridge. This new structure (with 
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H2' and the bridge) is called MDL1 . 

2. Building H3' and H1' Loops. (Figures 4C and 4D) Structure MDL1 supplied the coordinates for the residues 
preceding and following the segment where H3' was to be built into the model. The coordinates for the H3' segment 

s were taken from structure B. Since H1' was a short segment of 3 residues that would replace exactly 3 residues 

of M0L1, HV was built by side chain substitution. This structure is MDL2. 

3. Building L3'. (Figure 4E) Structure MDL2 supplied the coordinates for the residues preceding and following the 
segment where L3' was to be built into the model. The segment for L3' was built into the next stage of the model 

10 in a manner analogous to the way that MDL2 was built. The resulting structure Is MDL3. 

4. Building LI (Figure 4F) Structure MDL3 supplied the coordinates for the residues preceding and following the 
segment where LI ' was to be built into the model. The segment for LV was built into the next stage of the model 
in a manner analogous to the way that MDL2 was built. This structure is MDL4. 

IS 

5. Building L2'. (Figure 4G) Structure MDL4 supplied the coordinates for residues preceding the segment where 
L2' was to be built into the C-terminus of the model. The segment for L2' was built into the next stage of the model 
in a manner similar the way that MDL1 was built except that no connection (bridge or linker) was made to the C- 
terminus of the L2' segment. In order to stabilize the C-temiinus of the L2* loop, a disulfide bridge was constructed 

20 between the C-terminus of L2' and the segment that derived from the Vl N-terminal strand by substituting the Arg 

in L2' (at the position corresponding to L68 In VJ and the Thr at the position corresponding to L5 in Vl with Cys. 

6. Refinement. At each stage, atoms making bad contacts were rotated out of the way. This often required breaking 
the peptide bond between neighboring residues, rotating a backbone bond and then reforming the peptide bond. 

25 The functionality for these modeling operations is built into the INSIGHT program and other similar molecular 

modeling programs. The resulting modified peptide bond was added to the list of splice points. Strain in the splice 
point peptide bonds was minimized by subjecting the residues on either side of the spliced peptide bond to 100 
cycles of steepest descent minimization using the DISCOVER molecular mechanics program. The final structure 
was minimized for 1000 steps of steepest descent minimization without coulumbic interactions, followed by 200 

30 steps steepest descent, and 1000 steps of conjugate gradient minimization with columbic charge interactions. 

Figures 4A-4G show the. final model of the McPC603 (McPC603) Vh(Vh)- Vl(Vl) x-proteIn with corresponding 
pairs of loops highlighted with ribbons. The symmetry between the top and the bottom of the molecule can be seen. 
Each of the 5 pairs of spliced loops are highlighted in separate frames. 

35 A detailed comparison of the conformation of the new sites is presented in Figure 5. The upper view is of a first 

combining site on the top of the molecule while the lower view is the x-s'te built onto the bottom of the molecule. In the 
figure, corresponding regions of the loops are highlighted and side chain heavy atoms are shown. It can be seen that 
during the minimization process, there was some divergence in the conformation of the backbone and of the side 
' chains between these two sites, but there still remains an impressive degree of homology between them. This is a 

40 measure of the congruency of the lower loop stems and the corresponding upper loop stems. The obsen/ed homology 
also suggests that the steric environment surrounding the spliced loops must be similar to that of the parent loops, 
even though it is clear that the loops of the first site are more exposed than are those of the second x-site. 

Example 2 

45 

Defining top and bottom loop svmmetry and choosing splice points. 

Because of the high level of symmetry that exists between the V^ and V^ subunits, CDR loops can be spliced in 
a "homologous" fashion (Vh top loops onto Vh bottom loops, Vh(Vh). and Vl top loops onto Vl bottom loops, Vl(Vl)) 

so or in a "heterologous" fashion (Vl top loops onto the bottom of Vh, Vh(Vl), and Vh top loops onto the bottom of Vl, Vl 
(Vh)). In addition, there are two ways to connect the Vh and V^ regions: 1 ) linker peptide bridging from the G-terminus 
of Vh to the N-terminus of Vl (Vh - linker - Vl), or 2) linker peptide running from the C-terminus of Vl to the N-temiinus 
of Vh (Vl - linker - Vh). There are, therefore, four possible types of x-protein derived from Immunoglobulin CDRs. 
If one is splicing the CDRs of one Vh into the bottom loops of another Vh (as in Example 1) then H2 would be 

55 spliced Into the Vh C-terminal strand. The C-terminal end of the new H2' segment would end up on the outside of Vh 
region, near its N-terminal strand (Figure 3). Thus, if one is splicing heavy chain loops Into the bottom of the heavy 
chain in a sFv connected in the Vh - Vl order, the C-terminal end of H2' can be connected to the peptide that links the 
heavy and light chains. The L2 loop structure can likewise be spliced onto the C-termlnus of Vl- 
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MnHirn-"ii(^V - ^r^j) r-Protein based on McPC603: 

Tha followina discussion presents a detailed analysis of how to construct a second McPC603 antigen binding site 

T^lmsZTT^e bouom Ips of Fv regions will work with any antibody Fv. The existing structural hornotogy 

EzS=:i^— ^^^^ 

scale used to refer to splice points Is defined in Figure 2B. 
Determining loop splice points 

u./h.n tho r no inoDs are soliced into the lower loops, it is important to preserve those framework residues bordering 

of the splice points: 

1. Preservation of critical framework residues. 

2. Maximal Incorporation of CDR sequences. 

3 Fusion at closet possible splice points. In order for the spliced loops to maintain their native structure In the new 
site iTimportan t^t the chains be distorted as little as possible at the splice sites. Therefore one tries to chose 
s nc^ J^iras close as posstole to the positions where the aligned chains are cbsest together, e.g.. where cor- 
responding alpha carbons are separated by no more than a few angstroms. 

n«t»rminina the Pp'-^" P"i"«s for SDli^ -inn H.'^ fH-TL4^ Into H-RI ? and I (I.-TL4) Into L-BL2: 

Figures 6 and 7 show how the CDR 3 regions (H3 and L3) would be spliced into H-BL2 and L-BL2, respectK/ely^ 
CriticlLmevSk riLs H.BL2 and L-BL2 include the Gin at posttion H39 in H-BL2 and the Gin at position L45 ,n 

^'^ Referrinq to Figure 6. residues position H102 to H114 from H3 (positions H101 to H115) can be spHced into H- 
Bl^ T^elVe i^^^^^^^^^ 

Is T^e H3 Se at portion H1 1 5 is not included because splicing from position H11 4 ,n the H3 loop to posrt^n H47 
In the framework leads to less distortion of both the framework and the loop conformations. 

?hLT3T^o Dositions L97 to L104) can be spliced Into L-BL2 In a similar manner. Referring to Figure 7 the new 
LSM^IclSesre^^^^^^^ 

backbone conformatkxi In the x-protein. 

n»terminina the !=^nlir.» Points for S P »i^inn HI fH-TL1^ into H-RI 4, and LI (L-TL1) into L-BL4: 

Finures 8 and 9 show how the CDR1 regions would be spliced into the fourth bottom loops. Because sections of 
both H-BL4 f ndtsurauLain non-critlL residues do not align well wrth the CDRt portion of either CDR loop. 

'-'r^::::^HZS:Z " S residues iong in .cPCe03, and of -e, .e 3 r^^^^^^^^^ 

H31 to H33) that can be splteed are on the outer portion of H-TLl , whk^h faces the binding srta ^92;^^^^^ 
Ts a critical residue that is Involved in a consented Intrachain salt bridge, while pos.tK.ns H98 are conseive^ 

Inne^SLd residues. Consequently posit^ns H31 to H33 are spliced between posrtions H92 and H96 in H.BL4 to 

• °''1n mIpC603 LI (positions L25 to L41) is fairly long (17 residues), but only the internal 10 residues (Position 
,oL39^caSfce^^^^^^^ 

^93 to a^e nfeSinnlr strand residues that are structural^ analogous ^° "^^^^ "^,VrPC603Lu:g^^^ 
framework position L93 the LI loop Is splfced at position L39. Consequent^. 1 0 residues from the McPC603 L1 region 
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can be spliced into L-BL4. In other antibodies having shorter LI regions, only about 3 or 4 residues can be spliced. 
Determinino the Splice Points for Splicing H2 (H-TL2) into the C-terminal end of VH: 

5 Figure 1 0 shows how the H2 region of the inverted copy of McPC603 aligns with the N-terminal and the C-terminal 

ends of Vh- Residues positions H48 to H50 that flank the N-terminal end of H2 loop region (positions H50 to H70) align 
closely with Vh C-terminal residue positions H117 to H119, respectively. When splicing in the H2' loop, one wants to 
preserve as much of the Vh C-terminus as possible. Based on the superposition of the C-terminal and H2' backbones, 
splicing can be done between positions H115 and H119 in the C-terminal strand and between positions H47 and H51 

10 . in N-terminal flanking sequence of H2. Because several residues in the C-terminal sequence of Vh (and Vl as well) 
are probably important to stabilizing the packing of BL1 and BL4, the splice should be made so as to preserve as much 
of the C-terminus as possible. As shown in Figure 10, therefore, the splice is made between position H119 of the Vh 
C-terminal strand and position H51 of the H2' loop. The C-terminal end of H2' (arrow A, Figure 10) is located on the 
surface of Vh, near the N-terminal strand of Vh, when H2' is properly positioned relative to H3', L3', HI' and LV. To 

IS connect this end of the N-terminal end of the Interchain linker used to create an single chain Fv (arrow B, Figure 10), 
a 5 residue bridge peptide must be added (between points B and A in Figure 1 0). In this arrangement, the linker-bridge 
structure folds across the bottom of the Vh chain. The preferred bridge peptide would contain Ser for solubility and 
non-polar residues to help anchor it to the surface of Vh so as to stabilize the conformation of the H2' fold. In the model 
and in the sequence shown in Figure 1 2, a polyalanine bridge has been used. 

20 Figure 1 1 shows how the region of the inverted copy of McPC603 aligns with the N-terminal and C-terminal ends 

of Vl. Residue positions L49 to L57 that flank the L2 loop region (positions L57 to L63) align closely with Vl C-terminal 
residue positions L102 to LI 09. The same considerations are involved in making the L2' splice as with the H2' splice. 
The splice is made from C-terminal position L108 to L2 position L56. The L2 loop structure folds around until it runs 
parallel to the Vl N-terminal strand. In order to stabilize the L2' loop structure, we include a disulfide bond from the 

2$ end of the L2' loop to a residue in the N-terminal strand of Vl- To accomplish this the Arg at position L68 in the C- 
terminal end of the L2 loop and the Thr at position L5 in the N-terminal strand of VL'were changed to Cys. 

Figure 1 2 shows the sequence (SEQ I D NO: 1 7) of the resulting construct with the sequences of the primary CDR 
sites (bold) and secondary CDR sites (Underlined) identified. 

30 Example 3 

Determining y-Site Splice Points From Primarv Sequence Alignment 

The members of the Ig Superfamily share a high degree of structural homology. By using the homology that exists 

35 between members of known structure,- it is possible to engineer x-proteins from Ig Superfamily molecules whose se- 
quence is known but whose tertiary structure is not yet determined by crystallographic or NMR methods. 

Figure 13A (SEQ ID NOS: 10-16 and 18-30) and 13B (SEQ ID NOS: 1-9 and 31-40) shows the set of sequences 
of Fv domains for which structures are known and are available from the Brookhaven Protein Databank (PDB at 
Brookhaven National Laboratory). 

40 Table 1 lists the sequence and structure identification names, along with the Kabat et^aL classification and refer- 

ences for the sequences that appear in Figures 2B (SEQ ID NOS:'l-16) and 13A (SEQ ID NOS: 10-16 and 18-30) and 
1 3B (SEQ ID NOS: 1 -9 and 31 -40). The alignment of these sequences is based on the structural homology that exists 
between them, especially in the p-strand regions (OS and IS). Because of this high degree of structural correlation, 
one can assign positions in a "consensus 3-D model" to residues in a set of aligned p-barrel sequences. Furthermore, 

45 at certain positions, there exists a strong preference for certain residues or for residues having certain physical char- 
acteristics (Table 2). Thus, while the 3-D structures of most Fv sequences remain undetermined experimentally the 
residue preferences at these positions makes it possible to reliably align these sequences with those Vor Ig Superfamily 
sequences for which the 3-D structures have been experimentally determined. Ig V regions can thereby be aligned 
with the sequences of more distant members of the immunoglobulin superfamily such as the T-cell receptor which 

so was successfully modeled according to the McPC603 Fv structure (Novotny J. et al. Proc. Natl. A cad. Sci USA 88: 
8646-8650(1991). 
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Once a sequence has been positioned in relation to the set of structurally aligned V region sequences, one can 
use the splice point analysis developed in Exannple 2 to construct a model and delineate a sequence for a x-protein 
by analogy. The procedure for determining proper slice points is as follows: 

30 

1 . Sequences of undetermined 3-D structure are lined up with the set of structurally aligned sequences. The se- 
quence alignment process makes use of the conserved and semi-conserved residue positions listed In Table 1 to 
fix the alignment in certain regions. Between these anchor points it may be necessary to introduce gaps. Sequences 
in the p-strand regions (OS and IS) generally line up without the need for introducing gaps except in the N-terminal 

35 section of the light chain variable region (L OS1 - L BL1). Relative to all other light chain V region classes, kappa 

light chain class V sequences have a deletion at position L7 and an insertion at position L17. Lambda light chain 
sequences also have a deletion at position L7, but no insertion at position L17. Kappa light chain class VII se- 
quences have a deletion at position L23. Other than these gaps in the N-terminal section of the light chains, the 
alignment of sequences is may make use of conserved positions to establish anchor points. Further help In the 

40 alignment of inten/ening sequences is derived from consen^atton of sidechain properties (i.e., polar, non-polar, 

charged, acid, basic, etc) at certain positions. The other major gaps will exist in the CDR loop regions (HI . LI , H2, 
L2, H3 and L3). Because CDR loops are variable in sequence and length, the exact location of gaps in these 
regions is less critical as the sequences in these regions show little homology Structural variation tends to be the 
largest at the center of CDR loops of similar sequence, so gaps are preferably positioned in the center of CDR top 

45 loops. 

2. Once the sequences of one or more V regions of undetermined structure have been lined up with the structurally 
aligned reference set, structure positions are assignable to the residues that locate in the regions of the inner and 
outer p-strands. Because the x-site splice points occur near the ends of the p-strand regions, they can be assigned 

so by analogy with the splice positions determined for McPC 603 in Example 2. Given two sequences aligned with 

the reference set, the primary sequence is that into whose bottom loops the x-site will be built, and Is referred to 
below as the fargeJ sequence; while the secondary sequence contains the CDR loops which will be built into the 
X'S\le, and which will be referred to below as the source sequence, x-site loops are created by splicing top loop 
segments from the source sequence in place of bottom loop segments in the target sequence. For a VH(V^^)-VL 
(Vl) x-protein construct similar to that designed in Example 2, the spliceable portions of source CDRs, flanked by 
the fixed top loop p-strand end points, are labelled In Figure 13 as L1'(S), L2'(S), L3(Sj, H1'(S) and H3'(S). while 
the target bottom loop segments, flanked by the fixed bottom loop p-strand end points, are labelled in Figure 13 
as LI '(T) . L2'(T), L3'(T), H 1 '(T), H2'(T) and H3'(T). Thus, to create a x-site L3' loop in the L BL2 loop of the target 
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sequence one would splice the residues identified as L3'(S) in Figure 13 to the residues flanking the segment 
identified as L3'(T); similarly, to create a x-srte HI' loop in the H BL4 loop of the target sequence, one would splice 
the residues identified as H3'(S) in Figure 13 to the residues flanking the segment identified as Hr(T). and so 
forth These splices are done to create a Vh(Vh).Vl(Vl) X'Protein as in Example 2. To create a Vh(Vl)-Vl(Vh) X" 
Protein L3' loops in the H BL2 loop of the target sequence, one would splice the residues in the segment identified 
as L3'(S) in Figure 13 to the residues flanking the segment identified as H3'(T). Likewise, to create a x-site H1| 
loop in the L BL4 loop of the target sequence, one would splice the residues flanking the region Identified as HI' 
(S) in Figure 13 to the residues flanking the segment identified as L1'(T), and so forth. 

3 Alternatively, one can build structural models for the sequences of undetermined structure using the coordinates 
of one or more of the known structures as a basis for the framework residue locations. Programs such as the 
Biosym HOMOLOGY (Biosym. Inc., San Diego, CA) program are designed to aid In this process. The procedure 
is well established in the literature (Grear. J. n991^ Meth. in Enzymol. 202:239-252 (1991)). Once constructed, 
models of the parent Fv. sFv or V region and of the Fv regions of the corresponding x-site can thus be used to 
determine splice points and to construct a model of the x-protein in the manner set forth in Examples 1 and 2. 
Building a model provides insight into possible steric conflicts, particularly, as was noted in Example 2. at the 
interface between H2' and the surface residues of the Fv domain. 

Figure 14 shows the sequence of x-protein R19.1 (D1.3) (SEQ ID NO: 42) as it would be constructed following 
parts 1 and 2 of the example using the aligned sequences in Figure 14. The framework and primary GDR loops are 
taken from R19.9 while the sequences for the x-site loops were taken from the HV(S). H2'(S), H3'(S), Lr(S), and L3' 

(S) regions of D1.3. , . 

Figure 1 4 also shows the sequences of x-protein MCP(MGP) (McPC603(McPG603) (SEQ ID NO: 41 )) as it would 
be constructed in Examples 1 and 2. Also shown are sequences of x-proteins 26-1 0(D1 .3) (SEQ I D NO: 43) and 26-1 0 
(GL00P4) (SEQ ID NO: 44) as they would be constructed using the method of Example 3. 
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TABLE 2 (continued) 



1 Light Chain Position 


Residue Preference 


Heavy Chain Position 


Residue Preference 
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TABLE 2 (continued) 





Light Chain Position 


Residue Preference 


Heavy Chain Position 


Residue Preference 
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Notes: 1) Single letter amino acid codes; letters in parentheses signify less common 
preferences. 

2) signifies the occurrence of a gap. 
1 3) Each row corresponds to an equivalent location in the Vh and Vl chains. | 



EXAMPLE 4 
so 

Construction. Expression and Evaluation of a r-Protein 

The x-1 and x-2 genes (Figure 15A and B, SEQ ID NOS: 45 and 47) were prepared by mutagenesis of the 26-10 
sFv gene as described in (Huston, J.S., et al., Proc. Natl. Acad. Sci. USA . 85:5879-5883 (1988); Tai, M.-S., et al.. 
Biochemistry 29:8024-8030 (1 990)) and were incorporated into the pET vector described in Studler, F. W., and Moffat, 
B. A. . J. Mol. Biol. 1 89: 1 1 3 (1 986) behind a T7 promoter. Upon transformation of E. coli with this vector, direct expression 
produced each in the form of cytoplasmic inclusion bodies. Cells were treated with lysozyme to allow cell lysis and 
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ultracentrifugal Isolation of the inclusion bodies, which were then dissolved in 6 M guanldinium chloride containing 10 
mM dithlothreitol, 25 mM Tris. and 1 0 mM EDTA at pH 8. 1 ; the solution was incubated overnight at room temperature. 
The protein was then diluted into 3 M or 3.5 M urea buffer containing 25 mM Trie, and 10 mM EDTA, and a glutathione 
redox couple (1 mM oxidized. 0.1 mM reduced) at pH 8.1. Following 18 h. at 4^C, each was fully renatured by dialysis 

s into phosphate buffered saline (PBS), consisting of 0.05 M potassium phosphate, 0.15 M NaCI. pH 7.0, and 0.03% 
NaNa. Thex-1 and x-2 solutions were then passed through ouabain Sepharose columns, washed first with PBSA (PBS 
+0.03% NaNg), then with 1M NaCI in PBSA to remove any unbound material from the columns, and elutlon effected 
by displacing specifically bound protein with 20 mM ouabain in PBSA. The affinity purified x-1 and x-2 proteins were 
then examined by SDS polyacrylamide gel electrophoresis. Figure 1 6A and B shows the x-1 (SEQ ID NO: 46) and %- 

10 2 (SEQ ID NO: 48) polypeptide chains following their affinity isolation. In figure 16B, the oxidized x-2 (upper band) is 
compared with 26-10 sFv (lower band) in lanes denoted as mixture. Sequence analysis of the x-2 protein verified that 
the insertions noted in the gene sequence (Figure 15B) were present in the protein sequence CNBr fragments of the 
protein were made and sequenced as a mixture in an Applied Biosystems 470A gas-phase sequencer equipped with 
a model 120A on-line analyzer. The gel (Figure 16B) is also consistent with the increased molecular weight of the x- 

15 2 over the 26-10 sFv. Refolded protein that bound to the ouabain-Sepharose column has necessarily regained active 
antibody combining sites for digoxin-like cardiac glycosides typified by ouabain. 

Additional insights into the shape and properties of the x-1 and x-2 proteins are apparent from Superdex 75 size 
exclusion chromatography of the affinity purified % proteins in comparison to the 26-1 0 sFv, as shown in Figure 1 7. The 
single HI' insertion of x-1 adds two tyrosyl residues within the GYGY sequence, resulting in a pronounced tendency 

20 to dimerize and a very skewed profile indicative of dissociation into monomer (data not shown). In contrast, data shown 
in Figure 17 for x-2 (top panel) indicate that it appears perfectly behaved in solution, devoid of any apparent dimer. 
Th us, as one incorporates a multiplicity of x-CDRs in V regions, the known hydrophoblcity of CDR sequences apparently 
is contained by the aggregate of x-CDR conformation and interaction. The added HI 'and H2' loops necessarily increase 
the protein's Stokes radius, resulting in its elution position being between the 26-10 monomer and dimer positions 

25 (bottom panel). A mixing experiment that combines both proteins in a single chromatographic separation (middle panel) 
indicates that the x-2 shows no apparent interaction with the 26-10 monomer and dimer species, as the middle profile 
appears to be a simple additive composite of the top and bottom chromatograms. The peaks beyond 30 minutes 
(horizontal axis is in minutes) are simply injection artifacts on the HPLC system, probably due to buffer differences 
between the column and sample. 

30 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many 
equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be en- 
35 compassed by the following claims. 

Claims 

40 1. A chimeric multivalent immunoglobulin (Ig) Superfamily protein analogue consisting essentially of a variable frag- 
ment (Fv) of an Ig Superfamily protein region having two ligand binding sites comprising one or more polypeptide 
' chains forming a p-barrel domain containing complementarity-determining region-like (CDR-like) regions and 
framework region-like (FR-like) regions, said CDR-like regions defining a first ligand binding site and said protein 
analogue having a second ligand binding site segment spliced into the bottom FR-like regions of said p-barrel 

45 domain, and optionally said polypeptide chains have an amino acid sequence wherein said sequence is substituted 

or modified in the amino ac\d sequence of at least one amino acid residue. 

2. A chimeric multivalent Ig Superfamily protein analogue of Claim 1 wherein (i) a non-covalently associated two 
chain polypeptide forms a p-barrel domain; or (ii) a single chain polypeptide forms a p-barrel domain; or (ill) com- 

50 prising a single chain polypeptide forming a p-barrel domain wherein said single chain polypeptide is comprised 

of two polypeptide chains connected by a polypeptide linker spanning the distance between the C-terminus of one 
chain to the N-termlnus of the other chain; or (iv) the polypeptide chain is selected from the group consisting of: 
heavy chain (H), light chain (L), a chain (a) p chain (p), y chain (y), 5 chain (5), or e chain (e); and optionally the 
protein being cross linked at sites other than ligand binding sites to form a two dimensional array of chimeric 

55 multivalent protein analogues. 

3. Biological material having a nucleotide sequence which encodes a chimeric multivalent Ig Superfamily protein 
analogue of Claim 1, or a replicable recombinant DNA expression vector containing the nucleotide sequence. 
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4. A chimeric multivalent antibody analogue consisting essentially of variable fragment (Fv) region of an antibody 
having two ligand binding sites comprising one or more polypeptide chains forming a p-barrel domain containing 
complementarity determining regions (CDRs) and framework regions (FRs), said CDRs defining a first antigen 
binding site, said antibody analogue having a second antigen binding site segment spliced into the bottom FRs of 
said p-barrel domain, and optionally either: 

(a) a non-covalently associated two chain polypeptide forms a p-barrel domain; or 

(b) a single chain polypeptide forms a p-barrel domain; or 

(c) said CDRs and FRs are comprised of heavy chain (H) polypeptide chains and light chain (L) polypeptide 
chains derived from variable regions (V) of immunoglobulin proteins, and optionally either (i) the CDRs may 
be spliced into FRs of the p-barrel domain to form an additional binding site segment such that a variable 
heavy chain (Vh) CDR is spliced into a FR to form a Vh{Vh) polypeptide chain, or (ii) the CDRs may be 
spliced into FRs of the p-barrel domain to form an additional binding site segment such that a variable light 
chain (VJ CDR is spliced into a Vl FR to form a VJVJ polypeptide chain, or (iii) the CDRs may be spliced 
into FRs of the p-barrel domain to form an additional binding site segment such that a Vh CDR is spliced into 
a Vl FR to form a Vl(Vh) polypeptide chain, or (iv) the CDRs may be spliced into FRs of the p-barrel domain 
to form an additional binding site segment such that a Vl CDR is spliced into a Vh FR to form a Vh(Vl) polypep- 
tide chain; or 

(d) a single chain polypeptide forming a p-barrel domain wherein said single chain polypeptide is comprised 
of two polypeptide chains connected by a polypeptide linker spanning the distance between the C-terminus 
of one chain to the N-terminus of the other chain, and optionally said two polypeptide chains connected by a 
linker further comprise two Vh(Vh), Vl(Vl), Vh(Vl) or Vl(Vh) polypeptide chains, and to the N-terminal end of 
the polypeptide linker spanning the distance between the C-terminus of one polypeptide chain to the N-termi- 
nus of the other polypeptide chain may be added a polypeptide residue bridge which connects the N-terminal 
end of the linker to the C-terminal end of a CDR sequence which has been added to the C-terminal end of a 
FR sequence, which may comprise at least 19 amino acid residues; or 

e) said CDRs and FRs are of mammalian origin, for example said CDRs and FRs may be of mouse myeloma 
origin. 

5. Biological material having a DNA sequence which encodes the chimeric multivalent antibody analogue of Claim 4. 

6. A replicable recombinant DNA expression vector containing the DNA sequence of Claim 5. 

7. A chimeric multivalent Ig Superfamily protein analogue of Claim 1 wherein 

(a) one binding site Is reactive with a diagnostic imaging agent; or 

(b) one binding site is reactive with a radiosotope; or 

(c) one binding site is reactive with a cytotoxic substance; or 

(d) one binding site is reactive with an effector molecule; or 

(e) one binding site Is reactive with a marker on a cytotoxic cell. 

8. A chimeric multivalent Ig Superfamily protein analogue of Claim 1 for use in 

(i) imaging specific tissue in a host comprising: 

(a) administering to a host the chimeric multivalent Ig Superfamily protein analogue having one binding 
site reactive with a targeted tissue specific antigen and the other binding site reactive with a diagnostic 
imaging agent under conditions wherein said protein analogue binds to the targeted tissue; and 

(b) administering the imaging agent to the host under conditions whereby said imaging agent binds to the 
chimeric multivalent Ig Superfamily protein analogue resulting in a detectable image of the targeted tissue; 
or 

(ii) irradiating specific tissue in a host comprising: 

(a) administering to a host the chimeric multivalent Ig Superfamily protein analogue having one binding 
site reactive with a targeted tissue specific antigen and the other binding site reactive with a radiosotope 
under conditions whereby said protein analogue binds to the targeted tissue; and 

(b) administering the radioisotope to the host under conditions whereby said radioisotope binds to the 



22 



EP 0 640 130 B1 



chimeric multivalent Ig Supertamily protein analogue wherein binding of said chimeric protein analogue 
to targeted tissue and binding of said radioisotope to said chimeric protein analogue results in irradiation 
of the targeted tissue; or 

s (lil) delivering a cytotoxic substance to specific tissue in a host comprising: 

(a) administering to a host the chimeric multivalent Ig Superfamily protein analogue having one binding 
site reactive with a targeted tissue specific antigen and the other binding site reactive with a cytotoxic 
substance under conditions whereby said protein analogue binds to the targeted tissue; and 
10 (b) administering the cytotoxic substance to the host under conditions whereby said cytotoxic substance 

binds to the chimeric multivalent Ig Superfamily protein analogue wherein binding of said chimeric protein 
analogue and binding of said cytotoxic substance to said chimeric protein analogue results in delivering 
the toxic substance to the targeted tissue; or 

IS (iv) lysing target cells in a host having cytotoxic cells comprising administering to a host the chimeric multivalent 

Ig Superfamily protein analogue having one binding site reactive with a surface receptor of a cell targeted to 
be lysed, and the other binding site reactive with a marker on a cytotoxic cell under conditions whereby said 
protein analogue binds to the targeted cell and said cytotoxic cell binds to the chimeric multivalent Ig Super- 
family protein analogue, wherein binding of said chimeric protein analogue and binding of said cytotoxic cell 
* 20 results in lysis of the targeted cell; or 

(v) modifying the function of a cell surface receptor of specific tissue in a host comprising administering to a 
host a chimeric multivalent Ig Superfamily protein analogue having one binding site reactive with a targeted 
cell surface receptor and the other binding site reactive with an effector molecule under conditions whereby 
said protein analogue binds to the targeted tissue and said effector molecule binds to the chimeric multivalent 
25 Ig Superfamily protein analogue, wherein binding of said chimeric protein analogue and binding of said effector 

molecule results in selective modification of the function of the targeted cell surface receptor. 

9. The chimeric multivalent Ig Superfamily protein analogue of Claim 1 having one binding site reactive with a prese- 
lected ligand and the other binding site reactive with a substance labeled with a radioisotope or enzyme suitable 

30 for use as a quantifying agent In an In vitro diagnostic assay. 

10. A method for producing a chimeric multivalent Ig Superfamily protein analogue consisting essentially of a variable 
fragment (Fv) of an Ig Superfamily protein region having two ligand binding sites comprising the steps of: 

35 (a) determining the splice points for CDR-like regions to form additional ligand binding site segments on the 

bottom FR-like regions of a p-barrel domain whereby Insertion of CDR-like region amino acid residues into 
the FR-like region residues maintains the folded structure required for binding activity with a preselected ligand; 
(b) determining the amino acid sequence of the resulting construct having a first ligand binding site and a 
second ligand binding site; 

40 (c) deducing the DNA sequence encoding the amino acid sequence of (b); 

(d) synthesizing the DNA sequence; 

(e) inserting the DNA sequence into an appropriate expression vector and expressing the polypeptide in a 
suitable host system; 

(f ) Isolating and purifying the expressed polypeptide; and 

45 (g) refolding the purified polypeptide to its immunologically reactive conformation, thereby resulting in a chi- 

meric multivalent Ig Superfamily protein analogue; and optionally 

wherein determining the splice points is accomplished computationally by use of a computer-generated three 
dimensional structure of the chimeric multivalent Ig Superfarhily protein analogue, or wherein determining the 
so splice points is accomplished by primary sequence alignment. 

11 . A method for effecting cell-cell interactions comprising administering to a host a chimeric multivalent I g Superfamily 
protein analogue of Claim 1 having one binding site reactive with a targeted cell surface receptor of a first cell and 
the other binding site reactive with a targeted cell surface receptor of a second cell under conditions whereby said 

55 protein analogue binds to said first cell and second cell wherein binding of said first cell and second cell to the 

chimeric protein analogue results In interaction between the two cells. 

12. A molecular switch comprising a chimeric multivalent Ig Superfamily protein analogue of Claim 1 having one binding 
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site initiating a conformational change in said chimeric protein anabgue when said binding site is bound to ligand, 
whereby the conformational change causes the chimeric protein analogue to act as a molecular switch. 

13. A chimeric multivalent Ig Superfamlly protein analogue according to Claim 1 for use In therapy or diagnosis, for 
s example (a) imaging specific tissue in a host; or (b) irradiating specific tissue in a host; or (c) delivering a cytotoxic 

substance to specific tissue in a host; or (d) lysing target cells in a host; or (e) modifying the function of a cell 
surface receptor of specific tissue in a host; or (f) effecting cell-cell interactions In a host. 

14. Use of a chimeric multivalent Ig Superfamlly protein analogue according to Claim 1 for the manufacture of a diag- 
10 nostic agent for Imaging specific tissue in a host; or for the manufacture of a medicament for (a) irradiating specific 

tissue in a host; or (b) delivering a cytotoxic substance to specific tissue in a host; or (c) lysing target cells in a 
host; or (d) modifying the function of a cell surface receptor of specific tissue in a host; or (e) effecting cell-cell 
interactions in a host. 

75 

PatentansprQche 

1. Ein chlmares mehrwertlges Proteln-Analogon der Immunoglobulin (Ig) Superfamllle, das im wesenlichen aus ei- 
nem variablen Fragment (Fv) einer Proteinregion der Ig Superfamilie besteht, zwel LIgandbindungsstellen besitzt, 

20 eine Oder mehrere Peptidketten umfaQt, die eine p-FaB Domane bilden, die komplementaritatsbeatimmende Re- 
gion-ahnliche (CDR-ahnliche) Regbnen und Gerustregbn-ahnllche (FR-ahnllche) Regionen enthalt, wobei be- 
sagte CDR-ahnllche Regionen eine erste Ligandbindungsstelle deflnleren und besagtes Protein-Anatogon ein in 
die FR-ahnllchen Boden -Regionen von besagter p-RaB Domane gesplelBtes zweites Ligandbindungestellenseg- 
ment besltzt, und gegebenenfalls besagte Polypeptidketten eine Aminosaure-Sequenz besitzen, worin besagte 

25 Sequenz in der Aminosaure-Sequenz mindestens in einem Am inosau re-Rest substituiert Oder modifiziert ist. 

2. Ein chimares mehrwertiges Protein-Analogon der Ig Superfamilie nach Anspruch 1, worin (i) ein nicht-kovalent 
assoziiertes zweikettiges Polypeptid eine p-Fa3 Domane bildet; oder (ii) ein einzelkettiges Polypeptid eine p-FaG 
Domane bildet; oder (iii) umfassend ein einzelkettiges Polypeptid, das eine p-Fa3 Domane bildet, worin besagtes 

30 einzelkettiges Polypeptid aus zwel Polypeptidketten besteht, die Qber einen Polypeptid-Llnker verbunden sind, 
der sich Qber die Distanz zwischen dem C-Terminus von einer Kette bis zum N-Terminus der anderen Kette er- 
streckt; oder (iv) die Polypeptidkette aus der Gruppe gewahit ist bestehend aus: schwerer Kette (H), leichter Kette 
(L), a Kette (a) p Kette (P), y Kette (y), 6 Kette (8) oder e Kette (e) ; und wobei gegebenenfalls das Protein an 
anderen Stellen als LIgandbindungsstellen vernetzt ist, um einen zweidimensionalen Bereich von chimaren mehr- 

35 wertigen Protein-Analoga zu bilden. 

3. Biologisches Material mit einer Nucleotid-Sequenz, die ein chimares mehnwertlges Protein-Analogon der Ig Su- 
perfamllle nach Anspruch 1 codiert, oder ein replizierbarer rekombinanter DNS-Expressionavektor, der die Nu- 
cLeotld-Sequenz enthalt. 

40 

4. Ein chimares mehrwertiges Antlkorper-Analogon, das im wesentllchen aus einer variablen Fragment (Fv) Region 
eines Antikorpers besteht, zwei Ligandbindungsstellen besltzt, eine oder mehrere Polypeptidketten umfaBt, die 
eine P-Fa3 Domane bilden, die komplementarltatsbestimmende Regionen (CDRs) und Gerustregionen (FRs) ent- 
halten, wobei besagte CDRs eine erste Antigenbindungsstelle definiereh, wobei besagtes AntikorperAnalogon ein 

"^5 in die Boden-FRs von besagter P-Fa3 Domane gesplelBtes zweites Antigenbindungsstellensegment besitzt. und 

gegebenenfalls entweder: 

(a) ein nicht-kovalent assoziiertes zweikettiges Polypeptid eine p-FaB Domane bildet; oder 

(b) ein einzelkettiges Polypeptid eine p-FaB Domane bildet; oder 

50 (c) besagte CDRs und FRs schwerkettige (H) Polypeptbketten und leichtkettige (L) Polypeptidketten umfas- 

sen, die von variablen Regionen (V) von Immunoglobulin-Proteinenabstammen, und gegebenenfalls entweder 
(i) konnen die CDRs in FRs der p-FaB Domane gespleiBt sein, um ein zusatzliches Bindungsstellensegment 
zu bilden, so daB eine variable schwere Kette (V^) CDR in eine FR gespleiBt Ist, um eine eine (V^) 
Polypeptidkette zu bilden, oder.(ii) konnen die CDRs in FRs der p-FaB Domane gespleiBt sein, um ein zusatz- 

55 liches.Bindungsstellensegment zu bilden, so daB eine variable leichte Kette (Vh) CDR in eine Vl FR gespleiBt 

ist, um eine Vl (Vl) Polypeptidkette zu bilden, oder (ill) konnen die CDRs in FRs der p-FaB Domane gespleiBt 
sein, um ein zusatzliches Bindungsstellensegment zu bilden, so daB eine V^ CDR in eine Vl FR gespleiBt ist, 
um eine Vl (V^) Polypeptidkette zu bilden, oder (Iv) die CDRs konnen in FRs der p-FaB Domane gespleiBt 
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sein, urn ein zusatzliches Bindungsstellensegment zu bilden, sodaB eine VlCDR in eine Vh FR gesplei3t ist, 
urn eine Vh (VJ Polypeptidkette zu bilden; oder 

(d) ein einzelkettiges Polypeptid, das eine p-Fa3 Domane bildet, worin besagtes einzelkettiges Polypeptid 
zwei Polypeptidketten umfaBt, die uber einen Polypeptid-Linker verbunden sind, der sich uber die Distanz 
zwischen dem C-Terminus von einer Kette zum N-Terminus der anderen Kette erstreckt, und gegebenenfalls 
besagte uber einen Linker verbundene zwei Polypeptidketten umfassen ferner zwei (Vh), VlCVJ, Vh(Vl) 
Oder V|_(Vh) Polypeptidketten, und an dem N-terminalen Ende des Polypeptid-Linkers. der sicli Ober die Di- 
stanz zwischen dem C-Terminus von einer Polypeptidkette zum N-Terminus der anderen Polypeptidkette er- 
streckt, kann eine Polypeptidrest-Brucke angefugt sein, die das N-terminale Ende des Linkers mit dem C- 
terminalen Ende einer CDR-Sequenz verbindet, die an das C-terminale Ende einer FR-Sequenz angefOgt 
worden ist, die mindestens 19 Aminosaure-Reste umfassen kann; oder 

(e) besagte CDRs und FRs sind Sauger-Ursprungs, zum Beispiel konnen besagte CDRs und FRs Maus Mye- 
loma Ursprungs sein. 

IS 5. Biologisches Material mit einer Nucleotid-Sequenz, die ein chimares mehrwertiges Antikorper-Analogon nach An- 
spruch 4 codiert. 

6. Ein replizierbarer rekombinanter DNS-Expresslonsvektor, der die DNS-Sequenz nach Anspruch 5 enthalt. 
20 7. Ein chimares mehnwertiges Protein-Analogon der Ig Superfamilie nach Anspruch 1 , worin 

(a) eine Bindungsstelle mit einem diagnostischen bllderzeugenden Agens reaktiv ist; oder 

(b) eine Bindungsstelle mit einem Radioisotop reaktiv ist; oder 

(c) eine Bindungsstelle mit einer cytotoxischen Substanz reaktiv ist; oder 
25 (d) eine Bindungsstelle mit einem Effektor-Molekul reaktiv ist; oder 

(e) eine Bindungsstelle mit einem Marker auf einer cytotoxischen Zelle reaktiv ist. 

8. Ein chimares mehnwertiges Protein-Analogon der Ig Superfamilie nach Anspruch 1 zur Venwendung fur 
30 (j) eine bildliche Darstellung von spezifischem Gewebe in einem Wirt, die umfa3t: 

(a) Verabreichen des chimaren mehrwertlgen Protein-Analogons der Ig Superfamilie mit einer Bindungs- 
stelle, die mit einem Zielgewebe-spezifischen Antigen reaktiv ist, und der anderen Bindungsstelle, die mit 
einem diagnostischen bilderzeugenden Agens reaktiv ist, an einen Wirt unter Bedingungen, worin besag- 

35 tes Protein-Analogon an das Zielgewebe bindet; und 

(b) Verabreichen des bilderzeugenden Agens an den Wirt unter Bedingungen, wodurch besagtes bilder- 
zeugende Agens an das chimare mehnwertige Protein-Analogon der Ig Superfamilie bindet, was zu einem 
detektierbaren Bild des Zielgewebes fuhrt; oder 

40 (li) Bestrahlen von spezifischem Gewebe in einem Wirt, das umfaBt: 

(a) Verabreichen des chimaren mehrwertigen Protein-Analogone der Ig Superfamilie mit einer Bindungs- 
stelle, die mit einem Zielgewebe-spezifischen Antigen reaktiv ist, und der anderen Bindungsstelle, die mit 
einem Radioisotop reaktiv ist, an einen Wirt unter Bedingungen, wodurch besagtes Protein-Analogon an 
das zielgewebe bindet; und 

(b) Verabreichen des Radioisotops an den Wirt unter Bedingungen, wodurch besagtes Radioisotop an 
das chimare mehnwertige Protein-Analogon der Ig Superfamilie bindet, worin ein Binden besagten chi- 
maren Protein-Analogons an das Zielgewebe und ein Binden besagten radioisotops an besagtes chimares 
Protein-Analogon zur Bestrahlung des Zielgewebes fuhrt; oder . 

(iii) Oberbringen einer cytotoxischen Substanz an ein spezlfisches Gewebe in einem Wirt, das umfaBt: 

(a) Verabreichen des chimaren mehrwertigen Protein-Analogons der Ig Superfamilie mit einer Bindungs- 
stelle, die mit einem Zielgewebespezifischen Antigen reaktiv ist, und der anderen Bindungsstelle, die mit 
einer cytotoxischen Substanz reaktiv ist, an einen Wirt unter Bedingungen, wodurch besagtes Protein- 
Analogon an das Zielgewebe bindet; und 

(b) Verabreichen der cytotoxischen Substanz an den Wirt unter Bedingungen, wodurch besagte cytoto- 
xische Substanz an das chimare mehrwertige Protein-Analogon der Ig Superfamilie bindet, worin ein Bin- 
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den von besagtem Proteln-Analogon und ein Binden von besagter cytotoxischer Substanz an besagtes 
chlmares Protein-Analogon zu einem Oberbringen der toxischen Substanz an das Zielgewebe fuhrt; oder 

(iv) Lysieren von Zielzellen in einem Wirt mit cytotoxischen Zellen, das Verabreichen des chimaren nnehrwer- 
5 tigen Protein-Analogons der Ig Superfamilie nnit einer Bindungsstelle, die mit einem Oberflachenrezeptor einer 

zu lysierenden Zielzelle reaktiv ist, und der anderen Bindungsstelle, die mit einem Marker aut einer cytotoxi- 
schen Zelle reaktiv ist, an einen Wirt unter Bedlngungen umfaBt, wodurch besagtes Protein-Analogon an die 
Ziellzelle bindet und besagte cytotoxische Zelle an das cliimare mehrwertige Protein-Analogon der Ig Super- 
familie bindet, worin ein Binden von besagtem chimaren Protein-Analogon und ein Binden von besagter cy- 
10 totoxischer Zelle zu einer Lyse der Zielzelle fuhrt; oder 

(v) Modifizieren der Funktlon eines Zelloberflachenrezeptors eines spezlfischen Gewebes In einem Wirt, wobel 
das Modifizieren ein Verabreichen eines chimaren mehnn/ertlgen Protein-Analogons der Ig Superfamilie mit 
einer Bindungsstelle, die mit einem Zieizelloberflachenrezeptor reaktiv ist, und der anderen Bindungsstelle, 
die mit einem Effektormolekul reaktiv ist, an einen Wirt unter Bedingungen umfa3t, wodurch besagtes Protein- 

is Analogon an das Zleigewebe bindet und besagtes Effektormolekul an das chimare mehnvertlge protein-Ana- 

logon der Ig Superfamilie bindet, worIn ein Binden von besagtem chimaren Protein-Analogon und ein Binden 
von besagtem Effektormolekul zu einer selektiven Modifikation der Funktion des Zielzelloberflachenrezeptors 
fuhrt. 

20 g. Das chimare mehrwertige Protein-Analogon der Ig Superfamilie nach Anspruch 1 mit einer Bindungsstelle, die mit 
einem vorgewahlten LIganden reaktiv ist, und der anderen Bindungsstelle, die mit einer Substanz reaktiv Ist, die 
mit einem zur Verwendung als ein Mittel zur quantltativen Bestimmung in einem in vitro diagnostlschen Test ge- 
eigneten Radioisotop oder Enzym marklert ist. 

2S 10. Ein Verfahren zur Herstellung eines chimaren mehrwertigen Protein-Analogons der Ig Superfamilie, das im we- 
sentlichen aus einem variablen Fragment (Fv) einer Proteinregion der Ig Superfamilie besteht und zwei Llgand- 
bindungsstellen besitzt, wobei das Verfahren die Schrltte umfaBt: 

(a) Bestimmen der Splei(3-Punkte f Or CDR-ahnliche Regionen, um zusatzliche Ligandbindungsstellensegmen- 
30 te am unteren Ende der FR-ahnlichen Regionen einer p-FaR Domane zu bilden, wodurch eine Insertion von 

CDR-ahnliche Region-Aminosaure-Resten in die FR-ahnliche Region-Reste die gefaltete Struktur aufrecht 
erhalt, die fOr eine Bindungsaktivitat mit einem vorgewahlten Liganden erforderlich Ist; 

(b) Bestimmen der Aminosaure-Sequenz des resultierenden Konstrukts, das eine erste Ligandbindungsstelle 
und eine zweite Ligandbindungsstelle besitzt; 

35 (c) Herieiten der DNS-Sequenz, die die Aminosaure-Sequenz von (b) codiert; 

(d) Synthetisieren der DNS-Sequenz; 

(e) Inserieren der DNS-Sequenz in einen geeigneten Expressionsvektor und Expression des Polypeptide in 
einem geeigneten Wirtssystem; 

(f) Isolleren und Reinlgen des exprimlerten Polypeptide; und 

40 (g) Wiederfalten des gereinigten Potypeptids In seine Immunologisch reaktlve Konformation, dadurch wird ein 

chimares mehrwertiges Protein-Analogon der Ig Superfamilie erhalten; und gegebenenfalls 

worin eine bestimmung der SpleiB-Punkte durch Verwendung einer Computer-generierten dreidimensionalen 
Struktur des chimaren mehrwertigen Protein-Analogons der Ig Superfamilie rechnerisch ausgef Ohrt Ist ader worin 
45 eine bestimmung der SpleiB-Punkte durch ein Alignment der Primarsequenz ausgefuhrt ist. 

11. Ein Verfahren zur Beeinflussung von Zell-Zellwechselwirkungen, das Verabreichen des chimaren mehnwertigen 
Protein-Analogons der Ig Superfamilie nach Anspruch 1 mit einer Bindungsstelle, die mit einem Zieizelloberfla- 
chenrezeptor einer ersten Zelle reaktiv Ist, und der anderen Bindungsstelle, die mit einem Zielzelloberflachenre- 

50 zeptor einer zweiten Zelle reaktiv Ist, an einen Wirt unter Bedingungen umfaBt, wodurch besagtes Protein-Analo- 

gon an besagte erste Zelle und zweite Zelle bindet, worin ein Binden von besagter erster Zelle und zwerter Zelle 
an das chimare Protein-Analogon zu einer Wechselwirkung zwischen den beiden Zellen fuhrt. 

12. Ein moiekularer Schalter, der ein chimares mehrwertiges Protein-Analogon der Ig Superfamilie nach Anspruch 1 
55 mit einer Bindungsstelle umfaBt und eine Konformationsanderung in besagtem chimaren Protein-Analogon initiiert, 

wenn besagte Bindungsstelle an einen Liganden gebunden ist, wodurch die Konformationsanderung das chimare 
Protein-Analogon als einen molekularen Schalter fungieren laBt. 
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13. Ein chimares mehrwertiges Protein-Analogon der Ig Superfamille gemaB Anspruch 1 zur Verwendung In Therapie 
Oder Diagnose, zum Beispiel (a) Bildllchen Darstellung von spezifischem Gewebe in einem Wirt; oder (b) Bestrah- 
len von spezifischem Gewebe In eInem Wirt; Oder (c) Uberbrlngen einer cytotoxischen Substanz an spezlfisches 
Gewebe In einem Wirt; oder (d) Lysieren von Zielzellen in einem Wirt; oder (e) Modifizleren der Funktion eines 

s Zelloberfiachenrezeptors von spezifischem Gewebe in einem Wirt; oder (f) Beeinfiussen von Zell-Zellwechselwir- 

kungen in einem Wirt. 

14. Verwendung eines chimaren mehnwertigen Protein-Analogons der Ig Superfamilie gemaB Anspruch 1 zur Herstel- 
lung eines diagnostischen MIttels zur bildllchen Darstellung von spezifischem Gewebe in einem Wirt; oder zur 

10 Herstellung eines Medikaments zum (a) Bestrahlen von spezifischem Gewebe in einem Wirt; oder (b) Uberbrlngen 

eIner cytotoxischen Substanz an spezlfisches Gewebe In einem Wirt; oder (c) Lysieren von Zielzellen in einem 
Wirt; Oder (d) Modifizleren der Funktion eines Zelloberfiachenrezeptors von spezifischem Gewebe In elhem Wirt; 
Oder (e) Beeinfiussen von Zell-Zellwechselwirkungen in einem Wirt. 

IS 

Revendlcations 

Analogue proteique chimerique multivalent de la Superfamille des immunoglobulines (Ig) consistant essentielle- 
ment en un fragment variable (Fv) d'une region d'une protelne de la Superfamille des Ig ayant deux sites de liaison 
^ un llgand comprenant une ou plusieurs chaTnes polypeptidiques fomnant un domaine formant un tonneau p 
contenant des regions similaires k la region de determination de la complementarity (CDR-like) et des regions 
similaires ^ des regions de squelette (FR-like), lesdites regions CDR-IIke d6finlssant un premier site de liaison k 
un ligand et ledit analogue prot6ique ayant un second segment formant un site de liaison k un llgand 6p\ss6 k 
rinf6rieur des r6glons FR-IIke au fond dudit domaine formant un tonneau p, et 6ventuellement lesdites chaines 
polypeptidiques ayant une s6quence d'acides amines substitute ou modlflde sur au molns un r6sidu d'acldes 
amines. 

Analogue prot6ique chimerique multivalent de la Superfamille des Ig selon la revendication 1, dans lequel (i) un 
polypeptide k deux chaines assoclees de manlere non-covalente forme un domaine en tonneau P; ou (ii) un po- 
lypeptide k simple chaTne forme un domaine en tonneau P; ou (Hi) comprenant un polypeptide a simple chaine 
formant un domaine en tonneau p dans lequel ledit polypeptide k simple chaine est compose de deux chaines 
polypeptidiques Ii6es par un polypeptide linker augmentant la distance entre I'extr6mlt6 C-termlnale d'une chaTne 
et I'extr6mlt6 N-terminale de I'autre chame ; ou (iv) la chaTne polypeptldique est choisie parmi le groupe consistant 
en une chaine lourde (H), une chaine 16g6re (L). une chaTne a (a), une chaTne p (p), une chaTne 7(7), une chaTne 
5 (5) ou une chaTne e (e) ; et eventuellement la protelne 6tant llee de manl&re crois6e k des sites autres que les 
sites de liaison au llgand pour former un r6seau k deux dimensions d'anatogues pr6t6lques chim6riques multiva- 
lents. 

3. Materiel biologique ayant une sequence nucleotldique qui code pour un analogue proteique multivalent chimerique 
40 de la Superfamille des Ig selon la revendication 1 , ou un vecteur d'expression d'ADN recombinant r6pllcable con- 
tenant la sequence nucleotldique. 

4. Analogue d'anticorps multivalent chimerique consistant essentiellement en une region de fragment variable (Fv) 
d'un antlcorps ayant deux sites de liaison k un ligand comprenant une ou plusieurs chaTnes polypeptidiques fomnant 

45 un domaine formant un tonneau p contenant des regions de ddtermination de la compl6mentarlt6 (CDR) et des 

regions de squelette (FR), lesdites CDR definissant un premier site de liaison k I'antlgfene, ledit analogue de I'an- 
ticorps ayant un second segment formant un site de liaison a I'antigene episst k rinttrieurdes regions FR au fond 
dudit domaine formant un tonneau p, analogue dans lequel 6ventuellement soit : 

so (a) un polypeptide k deux chaTnes assocl§es de mani6re non covalente forme un domaine formant un tonneau 

P; ou 

(b) un polypeptide k simple chaTne forme un domaine formant un tonneau p ; ou 

(c) lesdites regions CDR et FR sont compos6es des chaTnes polypeptidiques de chaTnes lourdes (H) et des 
ChaTnes polypeptidiques de chaTnes Itg^res (L) d6rivees des regions variables (V) des proteines d'immuno- 

55 globullnes, et Eventuellement soit (i) les CDR peuvent §tre epissEes k I'interieur des regions FR du domaine 

formant un tonneau p pou r former un segment formant un site de liaison suppI6mentaire de sorte qu'une region 
CDR de la chaTne lourde variable (Vh) soit 6piss6e k I'int6rieur FR de pour former une chaTne polypeptldique 
Vh(Vh), ou (II) les regions CDR peuvent §tre §piss6es k I'int6rleur des regions FR du domaine formant un 
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tonneau ^ pour former un site de liaison supplementaire de sorte qu'une region CDR d'une chains legere 
variable (VJ est 6pissee k I'lnterieur d'une region FR de Vl pour former une chaine polypeptidique Vl(Vl), 
ou (iii) lea regions CDR peuvent etre 6piss§es k Tint^rieur des regions FR du domaine formant un tonneau p 
pour former un segment formant un site de liaison supplementaire tel qu'une region CDR de soit epissee 
k l'int§rieur d'une region FR de Vl pour former une chame polypeptidique VJVh); ou (iv) fes regions CDR 
peuvent etre epissees a I'interieur des regions FR du domaine formant un tonneau jj pour former un segment 
formant un site de liaison supplementaire de sorte qu'une region CDR de Vl soit 6piss6e k TintSrieur d'une 
region FR de V|_| pour former une cliaTne polypeptidique Vh(Vl) ; ou 

(d) un polypeptide k simple chaine forme un domaine formant un tonneau p, ledit polypeptide k simple chaine 
etant compose de deux chaines polypeptidtques liees par un polypeptide linker augmentant la distance entre 
rextr6mit6 C-terminale d'une chaine et I'extr6mit6 N-terminale de I'autre chaine, et 6ventuellement lesdites 
chaTnes polypeptidlques liees par un linker comprennent en outre deux chaTnes polypeptidlques Vh(Vh), Vl 
(Vl). Vh(Vl) ou Vl(Vh). et, k I'extremite N-terminale du polypeptide linker augmentant la distance entre I'ex- 
tremlte C-terminale d'une chaine polypeptidique et I'extremite N-terminale de I'autre chaine polypeptidique 
peut dtre ajoutd un pont form6 de r6sidus polypeptidlques qui lie I'extr6mit6 N-terminale du linker k rextr6mit6 
C-termlnale d'une sequence CDR qui a 6t6 ajout6e k I'extr6mit6 C-terminale d'une s6quence FR, qui peut 
comprendre au moins 19 residus d'acides amines ; ou 

(e) lesdites regions CDR et FR ont une origine mammiffere, par exemple lesdites regions CDR et FR peuvent 
avoir comme origine un myelome de souris. 

Materiel biologique ayant une sequence d'ADN qui code pour un analogue d'anticorps chim^rique multivalent 
selon la revendication 4, 

Vecteur d'expression d'ADN recombinant r6p!lcable contenant la sequence d'ADN selon la revendication 5 
Analogue prot§ique chim^rique multivalent de la Superfamllle des ig selon ta revendication 1 dans lequel 

(a) un site de liaison est capable de r6agir avec un agent d'imagerle diagnostique; ou 

(b) un site de liaison est capable de reagir avec un radioisotope ; ou 

(c) un site de liaison est capable de reagir avec une substance cytotoxique ; ou 

(d) un site de liaison est capable de r§agir avec une molecule effectrtee; ou 

(e) un site de liaison est capable de reagir avec un marqueur d'une cellule cytotoxique. 

Analogue protelque chimerlque multivalent de la Superfamllle des Ig selon la revendication 1 pour I'utilisation dans 

(i) rimagerie d'un tissu sp^cifique chez un hdte comprenant 

(a) I'administration a un hote d'un analogue proteique chimerlque multivalent de la Superfamille des Ig 
ayant un site de liaison pouvant reagir avec un antigene specifique du tissu cible et I'autre site de liaison 
pouvant r6agir avec un agent d'imagerle diagnostique dans des conditions selon lesquelles ledit analogue 
protelque se lie audit tissu cible ; et 

(b) I'administration de I'agent d'imagerle k I'hote dans des conditions selon lesquelles ledit agent d'imagerle 
se lie a I'analogue proteique chimerlque multivalent de la Superfamille des Ig, resultant en une image 
detectable du tissu clbl6 ; ou 

(ii) I'irradiation d'un tissu specifique chez un hdte comprenant : 

(a) I'administration k un hote d'un analogue proteique chimerlque multivalent de la Superfamille des Ig 
ayant un site de liaison pouvant reagir avec un antigene specifique du tissu cibie et I'autre site de liaison 
pouvant reagir avec un radioisotope dans des conditions selon lesquelles ledit analogue proteique se lie 
audit tissu cible ; et 

(b) I'administration du radioisotope k I'h6te dans des conditions selon lesquelles ledit radioisotope se lie 
k I'analogue proteique chimerlque multivalent de la Superfamllle des Ig, la liaison dudit analogue proteique 
chimerlque au tissu cibie et la liaison dudit radioisotope audit analogue proteique chimerlque, resultant 
dans I'irradiation du tissu cibie ; ou 

(iii) la liberation d'une substance cytotoxique k un tissu specifique chez un h6te comprenant : 
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(a) radministration ^ un h6te d'un analogue prot6ique chim^rique multivalent de la Superfamille des Ig 
ayant un site de liaison pouvant r6aglr avec un antigens specifique du tissu clbl6 et I'autre site de liaison 
pouvant r6agir avec une substance cytotoxique dans des conditions selon lesquelles ledit analogue pro- 
t^ique se lie audit tissu cible ; et 
5 (b) radministration de la substance cytotoxique k I'hote dans des conditions selon lesquelles ladite subs- 

tance cytotoxique se lie k I'analogue prot6ique chimerique multivalent de la Superfamille des Ig. la liaison 
de ladite substance cytotoxique k ladite protdine cliim6rique au tissu cibl6 et la liaison de ladite substance 
cytotoxique audit analogue protelque chimerique, resultant dans la liberation de la substance cytotoxique 
au tissu cible ; ou 

10 

(iv) la lyse des cellules cibles chez un h6te ayant des cellules cylotoxiques comprenant radministration k un 
hote de I'analogue proteique chimerique multivalent de la Superfamille des Ig ayant un site de liaison capable 
de r6agir avec un r6cepteur de surface d'une cellule cible k lyser, et I'autre site de liaison capable de reagir 
avec un marqueur sur une cellule cytotoxique dans des conditions selon lesquelles ledit analogue proteique 

IS se lie k la cellule cible et ladite cellule cytotoxique se lie audit analogue proteique chimerique multivalent de 

la Superfamille des Ig, la liaison dudit analogue proteique chimerique et la liaison da ladite cellule cytotoxique 
resultant dans la lyse de la cellule cibiee ; ou 

(v) la modification de la fonction d'un recepteur de surface celiulaire d'un tissu specifique chez un hote com- 
prenant radministration a un hote d'un analogue proteique chimerique multivalent de la Superfamille des Ig 

20 ayant un site de liaison capable de reagir avec un recepteur de surface celiulaire cible et I'autre site de liaison 

capable de reagir avec une molecule effectrice dans des conditions selon lesquelles ledit analogue proteique 
se lie audit tissu cible et ladite molecule effectrice se lie a I'analogue proteique chimerique multivalent de la 
Superfamille des Ig, la liaison dudit analogue proteique chimenque et la liaison de ladite molecule effectrice 
resultant dans la modification selective de la fonction du recepteur de surface celiulaire cible. 

25 . ' 

9. Analogue proteique chimerique multivalent de la Superfamille des Ig selon la revendication 1 ayant un site de 
liaison capable de reagir avec un ligand preselectionne et I'autre site de liaison capable de reagir avec une subs- 
tance marquee par un radioisotope ou une enzyme appropriee pour une utilisation en tant qu'agent de quantifica- 
tion dans un test de diagnostic in vitro. 

30 

10. Precede de production d'un analogue proteique chimerique multivalent de la Superfamille des Ig consistant es- 
sentiellement en un fragment variable (FV) d'une region d'une proteine de la Superfamille des Ig ayant deux sites 
de liaison k un ligand, comprenant les etapes consistant k : . 

35 (a) detemniner les points d'epissage des regions CDR-like pour former les segments fonmant le site de liaison 

suppiementaire au ligand k I'interleur des regions FR-like au fond d'un domaine.formant un tonneau p, rinser- 
tion de residus d'acides amines de la region CDR-like k I'interieur des residus de la region FR-like maintenant 
la structure repliee requise pour I'activite de liaison avec un ligand preselectionne ; 

(b) determiner la sequence d'acides amines de la construction resultante ayant un premier site de liaison au 
40 ligand et un second site de liaison au ligand ; 

(c) deduire la sequence d'ADN codant pour la sequence d'acides amines de (b); 

(d) synthetiser la sequence d'ADN ; 

(e) inserer la sequence d'ADN dans un vecteur d'expression approprie et exprimer le polypeptide dans un 
systems hdte approprie ; 

45 (f) isoler et purifier le polypeptide exprime; et 

(g) effectuer le repliement du polypeptide purifie dans sa conformation immunologiquement reactive, resultant 
en un analogue proteique chimerique multivalent de la Superfamille des Ig ; et eventuellement 

dans lequel la determination des points d'epissage est accomplle par ordinateur par utilisation d'une structure 
so tridimensionnelle gener6e par ordinateur d'un analogue proteique chimerique multivalent de la Superfamille des 
Ig, ou dans lequel la determination des points d'epissage est accomplie par I'alignement des sequences primaires. 

11. Procede pour realiser les interactions cellule-cellule comprenant radministration k un h6te d'un analogue proteique 
chimerique multivalent de la Superfamille des Ig selon la revendication 1 ayant un site de liaison capable de reagir 

55 avec un recepteur de surface celiulaire cible d'une premiere cellule et I'autre site de liaison capable de reagir avec 
un recepteur de surface celiulaire cible d'une seconde cellule dans des conditions selon lesquelles ledit analogue 
proteique se lie ^ ladite premiere cellule et k ladite seconde cellule, la liaison de ladite premiere cellule et la seconde 
cellule k I'analogue proteique chimenque resultant dans I'interaction entre les deux cellules. 



29 



EP 0 640 130 B1 

12. Commutateur moleculaire comprenant un analogue protejque chimerique multivalent de la Superfamille des Ig 
selon la revendicatlon 1 ayant un site de liaison initlant un changement conformationnel dans ledit analogue pro- 
t^ique chimerique lorsque ledit site de liaison est \\6 au ligand, le changement conformationnel permettant ^ I'ana- 
logue proteique chimerique de reagir comme commutateur moleculaires 

5 

13. Analogue proteique chimerique multivalent de la Superfamille des Ig selon la revendication 1 pour une utilisation 
en th6rapie ou en diagnostic, par exemple (a) I'imagerle d'un tissu spdcifique chez un hote ; ou (b) I'irradiation d'un 
tissu specifique chez un hote ; (c) la liberation d'une substance cytotoxique a un tissu speclfique chez un hote ; 
(d) la lyse de cellules cibles chez un hote ; (e) la modification de la fonction d'un recepteur de surface cellulaire 

10 d'un tissu spiciflque chez un h6te ; ou (f) la realisation d'Interactions cellule-cellule chez un hote. 

14. Utilisation d'un analogue proteique chimerique multivalent de la Superfamille des Ig selon la revendication 1 pour 
la fabrication d'un agent de diagnostic pour I'imagerie d'un tissu specifique chez un hote ; ou pour la fabrication 
d'un medicament pour (a) I'irradiation d'un tissu specifique chez un hote ou (b) I'irradiation d'un tissu specifique 

^5 chez un hote ; (c) la liberation d'une substance cytotoxique k un tissu specifique chez un h6te ; (d) la lyse de 

cellules cibles chez un hdte; (e) la modification de la fonction d'un recepteur de surface cellulaire d'un tissu spe- 
cifique chez un hdte; ou (f) la realisation d'Interactions cellule-cellule chez un hote. 

20 
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OXIDIZED x-1 



REDUCED x-1 




FIG. 16A 
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OXIDIZED x-2 MIXTURE OXIDIZED 26-10 sFv 




FIG. 16B 
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FIG. I7A 



26-10 sFv + X-2 protein 
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FIG. I7B 
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FIG. I7C 
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